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Abstract 

During  the  period  of  12/1/2005  -  5/30/2006,  we  expanded  our  research  from  generic  wireless 
sensor  networks  to  radar  sensor  networks.  For  radar  sensor  networks,  we  performed  the  following 
preliminary  studies: 

1.  Waveform  design  and  diversity  in  radar  sensor  networks  with  applications  to  automatic 
target  recognition  without  or  with  delay-doppler  uncertainty.  We  used  constant  frequency 
(CF)  pulse  waveform  and  linear  frequency  modulation  (LFM)  waveform  in  this  study. 

2.  We  proposed  a  Knowledge-based  Ubiquitous  and  Persistent  Sensor  networks  (KUPS)  for 
threat  assessment,  of  which  “sensor”  is  a  broad  characterization  concept.  It  means  diverse 
data  or  information  from  ubiquitous  and  persistent  sensor  sources  such  as  organic  sensors 
(e.g.,  radar)  and  human  intelligence  sensors. 

3.  Spatial-temporal-frequency  diversity  to  improve  the  detection  performance  of  Radar  Sensor 
Networks  in  the  presence  of  certain  types  of  interference  (clutter,  jamming,  noise  and 
interference  between  radar  sensors)  was  studied. 

For  non-radar  sensor  networks,  we  continuously  conducted  the  following  research  tasks: 

1.  Channel  Capacity  of  Virtual  MIMO-Based  Wireless  Sensor  Networks  with  Imperfect  CSI; 

2.  Cross-Layer  Design  for  MIMO-Based  Wireless  Sensor  Networks; 

3.  Statistical  Analysis  in  Wireless  Sensor  Networks  with  Application  to  Resources  Allocation; 

4.  MAC  Protocol  Design  for  UWB-Based  Wireless  Sensor  Networks; 

5.  and  Query  Processing  Optimization  in  Wireless  Sensor  Networks. 

Fourteen  papers  were  produced  during  the  past  six  months,  and  are  attached  to  this  report. 


1  Studies  on  Radar  Sensor  Networks 

1.1  Waveform  Design  and  Diversity  in  Radar  Sensor  Networks 

In  [1]  [2] ,  we  performed  some  theoretical  studies  on  constant  frequency  (CF)  pulse  waveform  design 
and  diversity  in  radar  sensor  networks  (RSN):  (1)  the  conditions  for  waveform  co-existence,  (2)  in¬ 
terferences  among  waveforms  in  RSN,  (3)  waveform  diversity  combining  in  RSN.  As  an  application 
example,  we  applied  the  waveform  design  and  diversity  to  automatic  target  recognition  (ATR)  in 
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RSN  and  propose  maximum-likehood  (ML)-ATR  algorithms  for  nonfluctuating  target  as  well  as 
fluctuating  target.  Simulation  results  show  that  our  waveform  diversity-based  ML-ATR  algorithm 
performs  much  better  than  single-waveform  ML-ATR  algorithm  for  nonfluctuating  targets  or  fluc¬ 
tuating  targets.  Conclusions  are  drawn  based  on  our  analysis  and  simulations  and  future  research 
works  on  this  research  topic  are  discussed. 

In  the  above  ATR  using  CF  pulse  waveform  design  and  diversity,  we  assumed  no  delay-doppler 
uncertainty.  It  is  not  true  for  ATR  in  target  search  phase  because  the  target  range  and  mobility 
are  not  yet  perfectly  known,  which  results  in  delay-doppler  uncertainty.  In  [3]  [4],  we  studied  linear 
frequency  modulation  (LFM)  waveform  design  and  diversity,  and  applied  it  to  ATR  with  delay- 
doppler  uncertainty  using  ML-ATR  algorithm.  Simulation  results  show  that  our  RSN  vastly  reduces 
the  ATR  error  comparing  to  a  single  radar  system  in  ATR  with  delay-doppler  uncertainty. 

In  [6] ,  we  studied  waveform  design  and  diversity  using  some  concepts  from  physical  layer  commu¬ 
nications  such  as  orthogonal,  non-coherent  detection,  and  coherent  detection.  We  also  used  signal 
representation  from  communication  domain,  i.e.,  taking  its  real  part  when  carriers  axe  considered. 
We  proposed  orthogonal  waveforms  for  RSN,  which  eliminates  interference  when  no  doppler  shift 
is  introduced.  Additionally,  this  approach  applies  the  advantage  of  spacial  diversity  through  equal 
gain  combination  performed  by  clusterhead.  When  doppler  shift  is  considered  and  interference  is 
unavoidable,  we  analyzed  the  performance  of  this  design  not  only  in  coherent  RSN,  but  in  nonco¬ 
herent  systems  as  well.  The  latter  scenario  is  more  challenging  as  doppler-shift  uncertainty  results 
in  more  complicated  implementation.  Monte  Carlo  simulation  shows  that  our  technique  provides 
much  better  detection  performance  than  single  radar  for  fluctuating  targets,  in  terms  of  probability 
of  false  alarm  and  miss  detection.  Conclusions  are  drawn  based  on  our  analysis  and  further  related 
research  areas  are  discussed. 

1.2  A  Network  Centric  Warfare  (NCW)  Model:  Knowledge-based  Ubiquitous 
and  Persistent  Sensor  Networks  for  Threat  Analysis 

In  current  and  future  military  operational  environments,  such  as  Global  War  on  Terrorism  (GWOT) 
and  Maritime  Domain  Awareness  (MDA),  warfighters  require  technologies  evolved  to  support  infor¬ 
mation  needs  regardless  of  location  and  consistent  with  the  users  level  of  command  or  responsibility 
and  operational  situation.  To  support  this  need,  the  DoD  has  developed  the  concept  of  Network 
Centric  Warfare  (NCW),  defined  as  “ military  operations  that  exploit  state-of-the-art  information 
and  networking  technology  to  integrate  widely  dispersed  human  decision  makers,  situational  and 
targeting  sensors,  and  forces  and  weapons  into  a  highly  adaptive,  comprehensive  system  to  achieve 
unprecedented  mission  effectiveness 

In  the  spirit  of  this  NCW  concept,  in  [5],  we  proposed  a  Knowledge-based  Ubiquitous  and  Per¬ 
sistent  Sensor  networks  (KUPS)  for  threat  assessment,  of  which  “sensor”  is  a  broad  characterization 
concept.  It  means  diverse  data  or  information  from  ubiquitous  and  persistent  sensor  sources  such 
as  organic  sensors  and  human  intelligence  sensors.  Our  KUPS  for  threat  assessment  consists  of 
two  major  steps:  threat  detection  using  fuzzy  logic  systems  and  threat  parameter  estimation  using 
radar  sensor  networks.  Our  fuzzy  logic  systems  can  combine  the  linguistic  knowledge  from  differ¬ 
ent  intelligent  sensors.  We  proposed  a  maximum-likelihood  (ML)  estimation  algorithm  for  target 
RCS  parameter  estimation,  and  we  showed  that  our  ML  estimator  is  unbiased  and  the  variance 
of  parameter  estimation  matches  the  Cramer-Rao  lower  bound.  Simulations  further  validate  these 
theoretical  results. 
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1.3  Spatial-Temp  oral- Frequency  Diversity  in  Radar  Sensor  Networks 

In  [7],  spatial-temporal- frequency  diversity  to  improve  the  detection  performance  of  Radar  Sensor 
Networks  in  the  presence  of  certain  types  of  interference  (clutter,  jamming,  noise  and  interference 
between  radar  sensors)  was  studied.  Besides  the  interference  between  radar  sensors,  performance 
of  the  network  depends  largely  on  other  interference,  especially  clutter,  which  is  extended  in  both 
angle  and  range,  and  is  spread  in  Doppler  frequency.  By  using  the  spatial-temporal  diversity,  we 
can  suppress  effects  of  these  interference.  In  [7],  we  also  proposed  a  receiver  for  diversity  combining 
in  RSN.  As  an  application  example,  we  applied  the  spatial-temporal-frequency  diversity  scheme  to 
improve  the  detection  performance  or  reduce  the  miss-detection  probability  at  a  low  false  alarm 
probability.  Simulation  results  for  both  non-fluctuating  targets  and  fluctuating  targets  show  that 
the  performance  of  our  proposed  scheme  is  superior  to  that  of  the  single  radar  with  the  spatial- 
temporal  diversity  only. 

2  Studies  on  Non-Radar  Sensor  Networks 

2.1  Channel  Capacity  of  Virtual  MIMO-Based  Wireless  Sensor  Networks  with 
Imperfect  CSI 

Multiple-input  multiple-output  (MIMO)  has  recently  emerged  as  one  of  the  most  significant  tech¬ 
nical  breakthroughs  in  modern  communications.  A  key  feature  of  MIMO  systems  is  the  ability 
to  turn  multipath  propagation,  traditionally  a  pitfall  of  wireless  transmission,  into  a  benefit  for 
the  user.  In  existing  works  on  MIMO  for  wireless  sensor  networks,  virtual  MIMO  schemes  based 
on  the  space-time  block  codes  (STBC)  have  been  widely  used,  and  it’s  demonstrated  that  the 
full  diversity  and  full  rate  are  achieved  which  enhances  power/bandwidth  efficiency  and  reliability. 
However,  all  the  existing  works  have  generated  substantial  insight  into  the  applications  of  MIMO 
to  wireless  sensor  networks  design  for  certain  simplified  models  of  the  communication  environment, 
such  as  complete  channel  state  information  (CSI),  absence  of  interference  generated  by  other  users 
in  the  network,  and  persistent,  homogeneous  traffic  from  the  physical  layer  point  of  view.  In  [8], 
we  studied  channel  capacity  of  virtual  MIMO-based  wireless  sensor  networks  with  imperfect  CSI. 
We  compared  the  channel  capacities  of  using  equal  power  allocation  and  waterfilling  strategy  and 
considered  different  channel  models  with  imperfect  CSI. 

2.2  Cross-Layer  Design  for  MIMO-Based  Wireless  Sensor  Networks 

The  multiple-input  and  multiple-output  (MIMO)  system  can  be  used  to  increase  throughtput 
through  multiplexing  or  to  improve  PLR  (Packet  Loss  Ratio)  throught  diversity.  Besides  phys¬ 
ical  layer  factors  (such  as  diversity  gain,  bandwidth,  and  SNR),  the  MAC  layer  and  network  layer 
protocols  also  affect  the  throughput  and  PLR,  for  example,  larger  number  of  transmitters  will  lead 
to  higher  link  failure  which  may  require  MAC  layer  re-transmission  and  network  layer  re-routing. 
In  [9],  we  studied  cross-layer  design  for  MIMO  system  considering  physical  layer,  MAC  layer  and 
network  layer  design.  Simulation  results  show  that  theoretical  result  based  on  pure  physical  layer 
design  such  as  larger  number  of  transmitters  have  better  diversity  gain  and  multiplexing  gain  does 
not  hold  when  the  MAC  layer  and  network  layer  are  also  considered,  which  indicates  that  some 
theoretical  results  on  MIMO  may  need  to  be  revisited. 


3 


2.3  Statistical  Analysis  in  Wireless  Sensor  Networks  with  Application  to  Re¬ 
sources  Allocation 

In  [11],  we  modeled  the  end-to-end  distance  for  a  given  number  of  hops  in  dense  planar  Wireless 
Sensor  Networks.  We  derived  that  the  closed-form  formula  for  single- hop  distance  and  postulate 
Beta  distribution  for  2-hop  distance.  When  the  number  of  hops  increases  beyond  three,  the  multihop 
distance  approaches  Gaussian.  The  Gaussian  approximation  model  is  also  applied  to  ranging,  which 
achieves  less  distance  error  than  Hop-TERRAIN  and  APS  (Ad  hoc  Positioning  System).  Our  error 
analysis  also  shows  the  distance  error  is  be  minimized  by  using  our  model. 

In  [10],  we  addressed  a  fundamental  problem  in  Wireless  Sensor  Networks,  how  many  hops 
does  it  take  for  a  packet  to  be  relayed  for  a  given  distance?  For  a  deterministic  topology,  this 
question  reduces  to  a  simple  geometry  problem.  However,  a  statistical  study  is  needed  for  randomly 
deployed  WSNs.  We  proposed  a  Maximum  Likelihood  decision  based  on  the  conditional  pdf  of 
f{r\Hi),  Due  to  the  computational  complexity  of  / (r\Hl),  we  also  proposed  an  attenuated  Gaussian 
approximation  for  the  conditional  pdf.  We  showed  that  the  approximation  visibly  simplifies  the 
decision  process  and  the  error  analysis.  The  latency  and  energy  consumption  estimation  are  also 
included  as  application  examples.  Simulations  show  that  our  approximation  model  can  predict  the 
latency  and  energy  consumption  with  less  than  half  RMSE,  compared  to  the  linear  models. 

2.4  MAC  Protocol  Design  for  UWB-Based  Wireless  Sensor  Networks 

In  [12],  we  proposed  a  MAC  protocol:  throughput-maximized  MAC  protocol  (TM-MAC),  based 
on  the  characteristics  of  ultra  wideband  (UWB)  technology.  In  UWB  communication  systems,  the 
transmission  parameters  are  tunable  to  match  the  requirements  of  data  flow.  In  TM-MAC,  we 
implement  concurrent  multiuser  access  instead  of  mutual  exclusion  method,  such  as  TDMA  and 
random  access.  For  multiuser  interference,  we  established  a  model  to  adaptively  adjust  the  data 
transmission  rate  to  ensure  a  satisfied  signal  to  noise  ratio  (SNR)  at  receiver  side.  We  also  analyzed 
the  relationship  among  the  theoretical  maximum  channel  capacity,  the  achievable  maximum  channel 
capacity  and  the  maximum  data  transmission  rate.  According  to  the  network  topology,  TM- 
MAC  re-divides  network  into  subsets,  in  which  communication  pairs  can  make  communication 
simultaneously  to  enhance  throughput  and  to  exploit  as  fast  as  possible  data  transmission  rate  for 
reliable  communication.  For  subset  formation,  we  proposed  a  general  analytical  framework,  which 
captures  the  unique  characteristics  of  shared  wireless  channel  and  allows  to  model  a  large  class 
of  systemwide  throughputmaximization  issue  via  the  specification  of  per-link  utilization  functions. 
Simulation  results  demonstrate  that  TM-MAC  can  implement  throughput  maximization  to  shorten 
latency  and  to  enhance  network  processing  capability. 

2.5  Query  Processing  Optimization  in  Wireless  Sensor  Networks 

Query  processing  has  been  studied  extensively  in  traditional  database  systems.  But  few  existing 
methods  can  be  directly  applied  to  wireless  sensor  database  systems  due  to  their  characteristics,  such 
as  decentralized  nature,  limited  computational  power,  imperfect  information  recorded,  and  energy 
scarcity  of  individual  sensor  nodes.  In  [13],  we  extended  our  previous  work:  quality-guaranteed 
and  energy-efficient  algorithm  (QGEE)  for  wireless  sensor  database  systems.  We  introduced  radius 
of  covering  disk  from  point  spread  function  (PSF)  aspect  and  sample  size  for  query  quality  and 
energy  consumption  control.  PSF  introduces  ambiguity  into  query  answers,  since  the  sensitivity  of 
nodes  is  nonuniform  within  monitoring  region.  Sample  size  determination  refers  to  the  process  of 
determining  exactly  how  many  samples  should  be  measured  in  order  that  the  sampling  distribution 
of  estimators  meets  users’  pre-specified  target  precision.  In  this  paper,  we  formulated  the  criteria  to 
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determine  the  optimum  radius  and  sample  size  according  to  users’  requirements  on  query  answers. 
Simulation  results  demonstrate  that  the  impact  of  sample  size  and  monitoring  coverage  on  query 
answers  in  terms  of  root  mean  square  error  (RMSE). 

In  [14],  we  proposed  two  methods  to  substitute  cosine  measure  for  vector  similarity:  Cosine- 
Length  Measure  (CLM)  and  Joint-Deference  Measure  (JDM).  Through  considering  the  impact  of 
vector  length  on  vector  similarity,  CLM  alleviates  the  disadvantage  of  traditional  VSM,  in  which  the 
confidence  of  query  answer  may  be  degraded  since  truly  similar  nodes  cannot  be  elected  according  to 
users’  requirement.  JDM  upgrades  the  accuracy  and  degrades  the  complexity  for  the  computation 
on  similarity  coefficient  through  simplifying  the  measure  from  vector  domain  to  scalar  domain. 
In  addition,  with  the  distributions  of  measurement  error  and  environment  noise  known  and/or 
unknown  respectively,  we  formulated  the  criteria  to  determine  the  optimum  sample  size  to  meet 
users’  pre-specified  target  precision.  Through  simulation,  we  checked  the  validities  and  sensitivities 
of  cosine  measure,  CLM  and  JMD  methods  on  answer  quality  and  network  lifetime.  Furthermore, 
our  simulation  results,  in  this  paper,  form  a  set  of  criteria  for  method  selection  based  on  specific 
applications. 
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Abstract 

In  this  paper,  we  perform  some  theoretical  studies  on  constant  frequency  (CF)  pulse  wave¬ 
form  design  and  diversity  in  radar  sensor  networks  (RSN):  (1)  the  conditions  for  waveform 
co-existence,  (2)  interferences  among  waveforms  in  RSN,  (3)  waveform  diversity  combining  in 
RSN.  As  an  application  example,  we  apply  the  waveform  design  and  diversity  to  automatic 
target  recognition  (ATR)  in  RSN  and  propose  maximum-likehood  (ML)-ATR  algorithms  for 
nonfluctuating  target  as  well  as  fluctuating  target.  Simulation  results  show  that  our  waveform 
diversity-based  ML-ATR  algorithm  performs  much  better  than  single-waveform  ML-ATR  algo¬ 
rithm  for  nonfluctuating  targets  or  fluctuating  targets.  Conclusions  are  drawn  based  on  our 
analysis  and  simulations  and  future  research  works  on  this  research  topic  are  discussed. 

Index  Terms  :  radar  sensor  networks,  waveform  diversity,  automatic  target  recognition, 
maximum-likelihood,  interferences,  ambiguity  function. 
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1  Introduction  and  Motivation 


The  network  of  radar  sensors  should  operate  with  multiple  goals  managed  by  an  intelligent  platform 
network  that  can  manage  the  dynamics  of  each  radar  to  meet  the  common  goals  of  the  platform, 
rather  than  each  radar  to  operate  as  an  independent  system.  Therefore,  it  is  significant  to  perform 
signal  design  and  processing  and  networking  cooperatively  within  and  between  platforms  of  radar 
sensors  and  their  communication  modules. 

Waveform  diversity  is  the  technology  that  will  allow  one  or  more  sensors  on  board  a  platform 
to  automatically  change  operating  parameters,  e.g.,  frequency,  gain  pattern,  and  pulse  repetition 
frequency  (PRF)  to  meet  the  varying  environments.  It  has  long  been  recognized  that  judicious  use 
of  properly  designed  waveforms,  coupled  with  advanced  receiver  strategies,  is  fundamental  to  fully 
utilizing  the  capacity  of  the  electromagnetic  spectrum.  However,  it  is  only  relatively  recent  advances 
in  hardware  technology  that  are  enabling  a  much  wider  range  of  design  freedoms  to  be  explored.  As 
a  result,  there  are  emerging  and  compelling  changes  in  system  requirements  such  as  more  efficient 
spectrum  usage,  higher  sensitivities,  greater  information  content,  improved  robustness  to  errors, 
reduced  interference  emissions,  etc.  The  combination  of  these  is  fuelling  a  worldwide  interest  in  the 
subject  of  waveform  design  and  the  use  of  waveform  diversity  techniques. 

In  the  existing  works  on  waveform  design  and  selection,  Fitzgerald  [5]  demonstrated  the  inap¬ 
propriateness  of  selection  of  waveform  based  on  measurement  quality  alone:  the  interaction  between 
the  measurement  and  the  track  can  be  indirect,  but  must  be  accounted  for.  Bell  [3]  used  infor¬ 
mation  theory  to  design  radar  waveform  for  the  measurement  of  extended  radar  targets  exhibiting 
resonance  phenomena.  In  [2],  singularity  expansion  method  was  used  to  design  some  discriminant 
waveforms  such  as  K-pulse,  E-pulse,  and  S-pulse.  Sowelam  and  Tewfik  [20]  developed  a  signal  selec¬ 
tion  strategy  for  radar  target  classification,  and  a  sequential  classification  procedure  was  proposed 
to  minimize  the  average  number  of  necessary  signal  transmissions.  Intelligent  waveform  selection 
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was  studied  in  [1] [9],  but  the  effect  of  doppler  shift  was  not  considered.  In  [11],  the  performance  of 
constant  frequency  (CF)  and  linear  frequency  modulated  (LFM)  waveform  fusion  from  the  stand¬ 
point  of  the  whole  system  was  studied,  but  the  effects  of  clutter  was  not  considered.  In  [19],  CF  and 
LFM  waveforms  were  studied  for  sonar  system,  but  it  was  assumed  that  the  sensor  is  nonintelligent 
(i.e.,  waveform  can’t  be  selected  adaptively).  All  the  above  studies  and  design  methods  were  focused 
on  the  waveform  design  or  selection  for  a  single  active  radar  or  sensor.  In  [17],  cross-correlation 
properties  of  two  radars  are  briefly  mentioned  and  the  binary  coded  pulses  using  simulated  anneal¬ 
ing  [4]  are  highlighted.  However,  the  cross-correlation  of  two  binary  sequences  such  as  binary  coded 
pulses  (e.g.  Barker  sequence)  are  much  easier  to  study  than  that  of  two  analog  radar  waveforms. 
In  this  paper,  we  will  focus  on  the  waveform  diversity  and  design  for  radar  sensor  networks  using 
constant  frequency  (CF)  pulse  waveform. 

The  rest  of  this  paper  is  organized  as  follows.  In  Section  2,  we  study  the  co-existence  of 
radar  waveforms.  In  Section  3,  we  analyze  the  interferences  among  radar  waveforms.  In  Section  4 
we  propose  a  RAKE  structure  for  waveform  diversity  combining  and  propose  maximum-likelihood 
(ML)  algorithms  for  automatic  target  recognition  (ATR).  In  Section  5,  we  provide  simulation  results 
on  ML-ATR.  In  Section  6,  we  conclude  this  paper  and  provide  some  future  works. 

2  Co-existence  of  Radar  Waveforms 

In  radar  sensor  networks  (RSN),  radar  sensors  will  interfere  with  each  other  and  the  signal-to- 
interference-ratio  may  be  very  low  if  the  waveforms  are  not  properly  designed.  We  will  introduce 
orthogonality  as  one  criterion  for  waveforms  design  in  RSN  to  make  them  co-existence.  Besides, 
the  radar  channel  is  narrow-band,  so  we  will  also  consider  the  bandwidth  constraint. 

In  our  radar  sensor  networks,  we  choose  CF  pulse  waveform.  The  CF  pulse  waveform  can  be 
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defined  as 


x(t) 


exp(j2n(3t)  —  T( 2  <t<  T/2 


(1) 


In  radar,  ambiguity  function  (AF)  is  an  analytical  tool  for  waveform  design  and  analysis  that 
succinctly  characterizes  the  behavior  of  a  waveform  paired  with  its  matched  filter.  The  ambiguity 
function  is  useful  for  examing  resolution,  side  lobe  behavior,  and  ambiguities  in  both  range  and 
Doppler  for  a  given  waveform[15].  For  a  single  radar,  the  matched  filter  for  waveform  x(t)  is  a :*(— t), 
and  the  ambiguity  function  of  CF  pulse  waveform  is 


A(t,Fd)  = 


r T /2 

I  x(t)  exp  (j27rF£)s)x*(t  —  r)dt 

J -T/2  •  t 

jFsin[7rFp(T  —  |r| )] 

TttFd 


-T<t<T 


(2) 


Three  special  cases  can  simplify  this  AF : 


1.  When  r  =  0, 


2.  and  when  Fd  =  0, 


A(0,Fd) 


E  sin(7r  FqT) 
Tk{Fd) 


A{t,  0) 


E(T-\t\) 

T 


3.  and 

.4(0,0)  =  E 


(3) 

(4) 

(5) 


However,  the  above  ambiguity  is  for  one  radar  only  (no  co-existing  radar). 

For  radar  sensor  networks,  the  waveforms  from  different  radars  will  interfere  with  each  other. 
We  choose  the  waveform  for  radar  i  as 


Xi(t)  =  \  —  exp[j2n(P  +  <J,)f]  -  T/2  <  t  <  T/2 


(6) 
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which  means  there  is  a  frequency  shift  Si  for  radar  i.  To  minimize  the  interference  from  one  waveform 
to  the  other,  optimal  values  for  Si  should  be  determined  to  have  the  waveforms  orthogonal  to  each 
other,  i.e.,  let  the  cross-correlation  between  Xi(t)  and  xn(t)  be  0, 


E  fT /2 

—  /  exp{j2n{P  +  Si)t]exp[-j2n(P  +  Sn)t\dt 
1  J-T/2 

£'sinc[7r(<5j  —  <5„)T] 


(7) 

(8) 


If  we  choose 


(9) 


where  i  is  a  dummy  index,  then  (8)  can  have  two  cases 


E  i  =  n 

< 

0 


(10) 


So  choosing  Si  =  ^  in  (6)  can  have  orthogonal  waveforms,  i.e.,  the  waveforms  can  co-exist  if  the 
carrier  spacing  is  1/T  between  two  radar  waveforms,  i.e.,  orthogonality  amongst  carriers  can  be 
achieved  by  separating  the  carriers  by  an  interger  multiple  of  the  inverse  of  waveform  pulse  duration. 
With  this  design,  all  the  orthogonal  waveforms  can  work  simultaneously.  However,  there  may  exist 
time  delay  and  doppler  shift  ambiguity  which  will  have  interferences  to  other  waveforms  in  RSN. 


3  Interferences  of  Waveforms  In  Radar  Sensor  Networks 

3.1  RSN  with  Two  Radar  Sensors 

We  are  interested  in  analyzing  the  interference  from  one  radar  to  another  if  there  exist  time  delay 
and  doppler  shift.  For  a  simple  case  where  there  are  two  radar  sensors  (i  and  n),  the  ambiguity 
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function  of  radar  i  (considering  interference  from  radar  n)  is 


FDi,FDn) 


< 


/OO 

[xj(i)  exp(j2nFDit)  +  xn(t  -  tn )  exp(j2irFDnt)\x* ( t  -  ti)dt\  (11) 

-OO 

rT/24-m  in 

j  xn(t  -  tn )  exp(j2nFDnt)x* {t  -  U)dt 


+ 


■T/2+max(t,-,tn) 
>■ r/2 


/ 


/  Xi(£) exp ( j2nFDit)x * ( t  -  U)dt 

J-T/2+ti 

xn(t  -  tn )  ex.p(j2irFDnt)x*  ( t  -  U)dt 


T/2+t, 
T/2+min(ti,tn) 


+ 


T/2+max(ti,tn) 

Esin[nFDi(T  -  |ij 


TnFDi 


To  make  analysis  easier,  we  assume  t,  =  tn  =  r,  then  (13)  can  be  simplified  as 


Ai(T,FDi,FDn)  «  |Esinc[7r(n-i  +  JFD„T)]|  + 


Some  special  cases  of  (14)  are  listed  as  follows: 


E sm[n FDi(T  -\t\)} 


T-kFd  i 


(12) 


(13) 


(14) 


1.  If  Fd i  =  Fpn  =  0,  and  Si  and  Sn  follow  (9),  then  (14)  becomes 

\E(T-\t\)}\ 


Ai(r,  0,0) 


(15) 


2.  If  r  =  0,  then  (14)  becomes 


Ai(0,FDi,FDn)  «  |£lsinc[7r(n  -  i  +  FDnT))\  + 


Esm(nFDiT) 


T-jrFDi 


(16) 


3.  If  Fd-  —  Fpn  =  0,  r  =  0,  and  5,  and  Sn  follow  (9),  then  (14)  becomes 


(0,0,0)  ?zE 


(17) 


3.2  RSN  with  M  Radar  Sensors 

It  can  be  extended  to  an  RSN  with  M  radars.  Assuming  time  delay  r  for  each  radar  is  the  same, 
then  the  ambiguity  function  of  radar  1  (considering  interferences  from  all  the  other  M  —  1  radars 
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with  CF  pulse  waveforms)  can  be  expressed  as 


M 


Ai(t,  FDi  ,  ■  •  •  ,  FDm)  «  |£sinc[7r(i  -  1  +  FDiT)}\  + 


*= 2 


-B sin^Fcj (T  -  |r|)] 


T-kFd j 


(18) 


Similarly,  we  can  have  three  special  cases, 


1.  If  Fd1  =  Fjj2  —  ■  ■  ■  =  Fdm  —  0,  and  frequency  shift  <5,  in  (6)  for  each  radar  follows  (9),  then 
(18)  becomes 


Ai(t,0,  (),•••  ,0) 


E(T-  |r|)] 


(19) 


comparing  it  against  (3),  it  shows  that  our  derived  condition  in  (6)  can  have  a  radar  in  RSN 
get  the  same  signal  strength  as  that  of  a  single  radar  (no  co-existing  radar)  when  the  doppler 
shift  is  0. 


2.  If  r  =  0,  then  (18)  becomes 

Ai (0,  FDl ,  FD2,  ■  ■  •  ,  FDm )  «  |£sinc[7r(i  -  1  +  FDiT)} \  + 


M 

E 

i=i 


£  sin(7r  FdjT) 


(20) 


TttFd1 

Comparing  to  (4),  a  radar  in  RSN  has  more  interferences  when  unknown  doppler  shifts  exist. 
3.  Fq !  =  Fd2  —  ■  ■  •  =  Fdm  =  0,  r  =  0,  and  <5j  in  (6)  follows  (9),  then  (18)  becomes 


^(0,0,0, •••  ,0  )*E 


(21) 


4  Waveform  Diversity  and  Combining  with  Application  to  Auto¬ 
matic  Target  Recognition 

In  RSN,  The  radar  sensors  are  networked  together  in  an  ad  hoc  fashion.  They  do  not  rely  on  a 
preexisting  fixed  infrastructure,  such  as  a  wireline  backbone  network  or  a  base  station.  They  are  self¬ 
organizing  entities  that  are  deployed  on  demand  in  support  of  various  events  surveillance,  battlefield, 
disaster  relief,  search  and  rescue,  etc.  Scalability  concern  suggest  a  hierarchical  organization  of  radar 
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sensor  networks  with  the  lowest  level  in  the  hierarchy  being  a  cluster.  As  argued  in  [10]  [7]  [6]  [13], 
in  addition  to  helping  with  scalability  and  robustness,  aggregating  sensor  nodes  into  clusters  has 
additional  benefits: 

1.  conserving  radio  resources  such  as  bandwidth; 

2.  promoting  spatial  code  reuse  and  frequency  reuse; 

3.  simplifying  the  topology,  e.g.,  when  a  mobile  radar  changes  its  location,  it  is  sufficient  for 
only  the  nodes  in  attended  clusters  to  update  their  topology  information; 

4.  reducing  the  generation  and  propagation  of  routing  information;  and, 

5.  concealing  the  details  of  global  network  topology  from  individual  nodes. 

In  RSN,  each  radar  can  provide  their  waveform  parameters  such  as  5*  to  their  clusterhead  radar, 
and  the  clusterhead  radar  can  combine  the  waveforms  from  its  cluster  members. 

In  RSN  with  M  radars,  the  received  signal  for  clusterhead  (assume  it’s  radar  1)  is 

M 

rq  (u,t)  =  '^^a(u)xi{t  —  ti)exp(j2nF£>it)  +  n{u,t)  (22) 

i=i 

where  a(u)  stands  for  radar  cross  section  (RCS)  and  can  be  modeled  using  non-zero  constants  for 
nonfluctuating  target  and  four  Swerling  target  models  for  fluctuating  target[15];  Fqx  is  the  doppler 
shift  of  target  relative  to  waveform  i;  t{  is  delay  of  waveform  i,  and  n(u,  t )  is  additive  white  Gaussian 
noise  (AWGN).  In  this  paper,  we  propose  a  RAKE  structure  for  waveform  diversity  combining,  as 
illustrated  by  Fig.  1. 
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According  to  this  structure,  the  received  ri(u,t)  is  processed  by  a  bank  of  matched  filters,  then 
the  output  of  branch  1  (after  integration)  is 

\Zi{u;ti,  ■  ■  ■  ,FDm)  | 

fT/2 

=  J  T^r1(u,t)x*1(t-t1)ds  (23) 

rT/2  M 

=  I  /  Ea*(u)a;*(f~<i)expO'27rFDit)  +  n(u,t)]x*(t-t1)dt|  (24) 

>/-r/2  i=1 

where  /5jy2  n(u,  t)xl(t  -  t\)dt  can  easily  be  proved  to  be  AWGN,  so 

A  f T 

KMl)|  =  /  n(u,t)x\(t  -  ti)dt  (25) 

J-T/2 

follows  Rayleigh  distribution.  Assuming  ti  =  t2  =  •  •  •  =  tM  =  r,  then  based  on  (18), 

M 

\Zi(u-,t,FDi,--- ,FDm)\  «  y; |a(tt)Fsinc[7r(f  -  1  +  .FP,T’)1I 

i=2 

a(u)E  sinbrFp,  (T  —  |r|)l 

+  - - —  +K«.t)I  (26) 

Similarly,  we  can  get  the  output  for  any  branch  m  (m  =  1, 2,  •  •  •  ,  M), 


\Zm(u;  r,  F&1 ,  •  •  •  ,FDm)\ 


X]  |a(u)Fsinc[ir(z -m  +  Fc.T)] | 
a(u)Fsin[7rFr,m(T-  |r|)J  ,  ,  ,  s, 

SftT -  +W“'T)I 


So  | (u,  r,  Fdj  ,  ■  •  ■  ,  Fqm  )  |  consists  of  three  parts,  signal  (reflected  signal  from  radar  m  waveform) : 
q(u)£  sin[7rFD^  (T— |r|)l  .  ,  r  r 

t-kFdZ  >  interferences  from  other  waveforms:  |a(u)Fsinc[7r(i  ~m  +  FDiT)]\, 

and  noise:  \n(u,  r)|. 

We  can  also  have  three  special  cases  for  \Zm(u-,T,  FDl,  ■  ■■  ,FD%  )|: 


1.  When  FDl  =  •  •  •  =  FDm  =  0, 


|2’m(u;r,0,0," 


Fa(u)(T  —  |r  |)] 

- j, -  +  Hu,t)  | 
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which  means  if  there  is  no  doppler  mismatch,  there  will  be  no  interference  from  other  wave¬ 


form. 


2.  If  t  =  0,  then  (27)  becomes 


I Zm{u\  0,  FDl ,  •  •  ■  ,  FDm)\ 

M 

^  |a(u).Esinc[7r(i  —  m  +  + 

1=1,7^771 


a(u)Esin[nFDmT} 


TnFi 


D„ 


+  |n(u)|  (29) 


3.  If  t  =  0,  and  Fd1  =  ■  ■  •  —  Fqm  —  0,  then  (27)  becomes 


\Zm{u;  0, 0,0,  •  •  •  ,0)|  w  | Ea(u)\  +  |n(u)| 


(30) 


Doppler  mismatch  happens  quite  often  in  target  search  where  target  velocity  is  not  yet  known. 
However,  in  target  recognition,  generally  high-resolution  measurements  of  targets  in  range 
(r  =  0)  and  doppler  are  available,  so  (30)  will  be  used  for  automatic  target  recognition. 

How  to  combine  all  the  Zm's  (m  =  1,2,-  ••  ,  M)  are  very  similar  to  the  diversity  combining 
in  communations  to  combat  channel  fading,  and  the  combination  schemes  may  be  different  for 
different  applications.  In  this  paper,  we  are  interested  in  applying  RSN  waveform  diversity  to 
automatic  target  recognition  (ATR),  e.g.,  recognition  that  the  echo  on  a  radar  display  is  that  of  an 
aircraft,  ship,  motor  vehicle,  bird,  person,  rain,  chaff,  clear-air  turbulence,  land  clutter,  sea  clutter, 
bare  montains,  forested  areas,  meteors,  aurora,  ionized  media,  or  other  natural  phenomena.  Early 
radars  were  “blob”  detectors  in  that  they  detected  the  presence  of  a  target  and  gave  its  location  in 
range  and  angle,  and  radar  began  to  be  more  than  a  blob  detector  and  could  provide  recognition 
of  one  type  of  target  from  another [17].  It  is  known  that  small  changes  in  the  aspect  angle  of 
complex  (multiple  scatter)  targets  can  cause  major  changes  in  the  radar  cross  section  (RCS).  This 
has  been  considered  in  the  past  as  a  means  of  target  recognition,  and  is  called  fluctuation  of  radar 
cross  section  with  aspect  angle,  but  it  has  not  had  much  success[17].  In  this  paper,  we  propose 
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a  maximum  likelihood  automatic  target  recognition  (ML-ATR)  algorithm  for  RSN.  We  will  study 
non-fluctuating  target  as  well  as  fluctuating  target. 


4.1  ML-ATR  for  Non-fluctuating  Targets 


In  some  sources,  the  non-fluctuating  target  is  identified  as  “Swerling  0”  or  “Swerling  5”  model[18]. 
For  non- fluctuating  target,  the  RCS  am(u )  is  just  a  constant  a  for  a  given  target.  In  (30), 
\n{u,  t)\  follows  Rayleigh  distribution  since  n(u,  r)  is  a  Gaussian  random  variable  for  given  r, 
so  \Zm(u;  0, 0,  •  •  •  ,0)|  follows  Rician  distribution  because  signal  E\a  is  a  constant.  Let  ym  = 
| Zm(u\  0, 0,  •  •  •  ,0)|,  then  the  probability  density  function  (pdf)  of  ym  is 


a  \  ,  (ym  T  A  )  i  .  2\ym 

f(Vm)  =  — 2- exp h^-2 70  — 2“ 

<TZ  G* 


(31) 


where 

A  =  E\a\,  (32) 

a2  is  the  noise  power  (with  I  and  Q  sub-channel  power  cr2/ 2),  and  Iq(-)  is  the  zero-order  modified 
Bessel  function  of  the  first  kind.  Let  y  =  [j/i,  J/2>  •  •  •  ,  J/m],  then  the  pdf  of  y  is 

M 

/( y)  -  n  /(y«)  (33) 

771=1 

Our  ATR  is  a  multiple-category  hypothesis  testing  problem,  i.e.,  to  decide  a  target  category 
(e.g.  aircraft,  ship,  motor  vehicle,  bird,  etc)  based  on  ri(u,t).  Assume  there  are  totally  N  categories 
and  category  n  target  has  RCS  an,  so  the  ML-ATR  algorithm  to  decide  a  target  category  C  can 
be  expressed  as, 


C  =  argmax()r_1/(y|A  =  E\an\) 


M 

N  TT  ^ 

=  argmaxn=1  II 

XX  a 

m— 1 


r  (.Vm  ^0iV)  1  t  /2Fl|an|ym. 

2  eXP^  „2  _2  ) 


<7 


(34) 

(35) 
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4.2  ML-ATR  for  Fluctuating  Targets 

Fluctuating  target  modeling  is  more  realistic  in  which  the  target  RCS  is  drawn  from  either  the 
Rayleigh  or  chi-square  of  degree  four  pdf.  The  Rayleigh  model  describes  the  behavior  of  a  complex 
target  consisting  of  many  scatters,  none  of  which  is  dominant.  The  fourth-degree  chi-square  models 
targets  having  many  scatters  of  similar  strength  with  one  dominant  scatter.  Based  on  different 
combinations  of  pdf  and  decorrelation  characteristics  (scan-to-scan  or  pulse-to-pulse  decorrelation), 
four  Swerling  models  are  used[15J.  In  this  paper,  we  will  focus  on  “Swerling  2”  model  which  is 
Rayleigh  distribution  with  pulse-to-pulse  decorrelation.  The  pulse-to-pulse  decorrelation  implies 
that  each  individual  pulse  results  in  an  independent  value  for  RCS  a. 

For  Swerling  2  model,  the  RCS  |a(u)|  follows  Rayleigh  distribution  and  its  I  and  Q  subchannels 
follow  zero-mean  Gaussian  distributions  with  variance  q2.  Assume 


a(u)  =  ac[(u)  +  jotQ{u) 


and  n{u)  —  n/(u)  +  jriQ(u)  follows  zero-mean  complex  Gausian  distribution  with  variance  a 2  for 
the  I  and  Q  subchannels.  According  to  (24),  (27),  and  (30), 


I Zm{u-,  0, 0, 0,  •  •  •  ,  0)|  «  | Ea{u)  +  n(u)| 


is  a  more  accuate  approximation.  Since  a(u)  and  n(u)  are  zero-mean  complex  Gaussian  random 
variables,  so  Ea(u)  +  n(u)  is  a  zero-mean  Gaussian  random  variable  with  variance  E2y2  +  a2  for 
the  I  and  Q  subchannels,  which  means  ym  =  |Zm(u;0,0,  •  •  •  ,  0) |  follows  Rayleigh  distribution  with 
parameter  y/ E2y2  +  a2, 


£2q2  +  cr2  6XP^  E2^+  a2  ^ 


The  mean  value  of  ym  is  y/-{E  and  variance  is  EiL^f+^1 .  The  variance  of  signal 


~4  n\E  7  and  the  variance  of  noise  is  —  ^2. 


12 


(39) 


Let  y  =  [2/1, 2/2,  ■  ■  ■  ,2/m],  then  the  pdf  of  y  is 

M 

/( y)  =  II  /W 

771=1 

Assume  there  are  totally  N  categories  and  category  n  target  has  RCS  an(u)  (with  variance  72),  so 
the  ML-ATR  algorithm  to  decide  a  target  category  C  can  be  expressed  as, 


C  =  argmax"=1/(y|7  =  7„) 
M 


N 

arg  maxn=1 


n 


2/m 


ij  E2 72  +  <r 


exp(- 


2/m 


E2H  +  cr2 


) 


(40) 

(41) 


5  Simulations 

Radar  sensor  networks  will  be  required  to  detect  a  broad  range  of  target  classes.  Too  often,  the 
characteristics  of  objects  that  are  not  of  interest  (e.g.,  bird)  will  be  similar  to  those  of  threat  objects 
(e.g.,  missile).  Therefore,  new  techniques  to  discriminate  threat  against  undesired  detections  (e.g. 
birds,  etc.)  are  needed.  We  applied  our  ML-ATR  to  this  important  application,  to  recognize  a 
target  from  many  target  classes.  We  assume  that  the  domain  of  target  classes  is  known  a  priori  (N 
in  Sections  4.1  and  4.2),  and  that  the  RSN  is  confined  to  work  only  on  the  known  domain. 

For  non-fluctuating  target  recognition,  our  targets  have  5  classes  with  different  RCS  values, 
which  are  summarized  in  Table  1  [IT] .  We  applied  the  ML-ATR  algorithms  in  Section  4.1  (for 
nonfluctuating  target  case)  to  classify  an  unknown  target  as  one  of  these  5  target  classes.  At  each 
average  SNR  value,  we  ran  Monte-Carlo  simulations  for  105  times  for  each  target.  The  average 
SNR  value  is  based  on  the  average  power  from  all  targets  (signal  variance),  so  the  actual  SNRs 
for  bird  and  missile  are  much  lower  than  the  average  SNR  value.  For  example,  at  the  average 
SNR=16dB,  the  bird  target  SNR— -33.1646dB,  and  missile  target  SNR=0.8149dB;  and  at  average 
SNR=20dB,  the  bird  target  SNR=-29.1646dB,  and  missile  target  SNR=4.8149dB.  In  Fig.  2(a) (b), 
we  plotted  the  probability  of  ATR  error  in  bird  and  missile  recognition  when  they  are  assumed 
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as  nonfluctuating  targets.  Observe  both  figures,  single  radar  system  can’t  perform  well  in  both 


recognitions,  and  their  probability  of  ATR  error  is  above  10%,  which  can’t  be  used  for  real-world 
ATR.  However,  the  5-radar  RSN  and  10-radar  RSN  can  maintain  very  low  ATR  errors.  In  Fig. 
2(c),  we  plotted  the  average  probability  of  ATR  error  for  all  5  targets  recognition.  Since  the  other 
3  targets  (different  aircrafts)  have  much  higher  SNRs,  so  their  ATR  error  is  lower,  which  makes  the 
average  probability  of  ATR  error  lower. 

For  fluctuating  target  recognition,  we  assume  the  fluctuating  targets  follow  “Swerling  2”  model 
(Rayleigh  with  pulse-to-pulse  decorrelation),  and  assume  the  RCS  value  listed  in  Table  1  to  be 
the  standard  deviation  (std)  7n  of  RCS  an(u)  for  target  n.  We  applied  the  ML- ATR  algorithm 
in  Section  4.2  (for  fluctuating  target  case)  for  target  recognition  within  the  5  targets  domain. 
Similarly  we  ran  Monte-Carlo  simulations  at  each  SNR  value.  In  Fig.  3(a)  (b)(c),  we  plot  the  ATR 
performance  for  fluctuating  targets  and  compared  the  performances  of  single  radar  system,  5-radar 
RSN,  and  10-radar  RSN.  Observe  that  the  two  RSNs  perform  much  better  than  the  single  radar 
system.  The  ATR  error  for  missile  is  higher  than  that  of  bird  because  Rayleigh  distribution  of 
missile  has  lots  of  overlap  with  its  neighbor  targets  (aircrafts).  Comparing  Fig.  2(a)(b)(c)  to  Fig. 
3(a)  (b)(c),  it  is  clear  that  higher  SNRs  are  needed  for  fluctuating  target  recognition  comparing  to 
nonfluctuating  target  recognition.  According  to  Skolnik[17],  radar  performance  with  probability  of 
recognition  error  ( pe )  less  than  10%  is  good  enough.  Our  RSN  with  waveform-diversity  can  have 
probability  of  ATR  error  much  less  than  10%  for  each  target  ATR  as  well  as  the  average  ATR  for 
all  targets.  However,  the  single  radar  system  has  probability  of  ATR  error  much  higher  than  10%. 
Observe  Fig.  3(c),  the  average  probability  of  ATR  error  of  single-radar  is  impossible  to  be  less  than 
10%  even  at  extreme  high  SNR.  Our  RSN  with  waveform  diversity  is  very  promising  to  be  used  for 
real-world  ATR. 
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6  Conclusions  and  Future  Works 


We  have  studied  constant  frequency  pulse  waveform  design  and  diversity  in  radar  sensor  networks. 
We  showed  that  the  waveforms  can  co-exist  if  the  carrier  frequency  spacing  is  1/T  between  two 
radar  waveforms.  We  made  analysis  on  interferences  among  waveforms  in  RSN  and  proposed  a 
RAKE  structure  for  waveform  diversity  combining  in  RSN.  As  an  application  example,  we  applied 
the  waveform  design  and  diversity  to  automatic  target  recognition  (ATR)  in  RSN  and  proposed 
maximum-likehood  (ML)-ATR  algorithms  for  nonfluctuating  target  as  well  as  fluctuating  target. 
Simulation  results  show  that  RSN  using  our  waveform  diversity-based  ML- ATR  algorithm  performs 
much  better  than  single  radar  system  for  nonfluctuating  targets  and  fluctuating  targets  recognition. 

In  our  future  works,  we  will  investigate  the  ATR  when  multiple  targets  co-exist  in  RSN,  and 
the  number  of  targets  are  time- varying.  In  this  paper,  we  used  spatial  diversity  combining.  For 
multi-target  ATR,  we  will  further  investigate  spatial-temporal-frequency  combining  for  waveform 
diversity  in  RSN. 
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Index  n 

Target 

RCS 

1 

Bird 

2 

Conventional  unmanned  winged  missile 

Mill! 

3 

Small  single-engine  aircraft 

i 

4 

Small  flighter  aircraft  or  4  passenger  jet 

5 

Large  flighter  aircraft 

6 
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Figure  1:  Waveform  diversity  combining  by  clusterhead  in  RSN. 
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(a) 


Figure  2:  Probability  of  ATR  error  for  nonfluctuating  targets  at  different  average  SNR  (dB)  values, 
(a)  bird,  (b)  missile,  (c)  the  average  probability  of  ATR  error  for  5  targets. 
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(b) 


(c) 


Figure  3:  Probability  of  ATR  error  for  fluctuating  targets  at  different  average  SNR  (dB)  values, 
(a)  bird,  (b)  missile,  (c)  the  average  probability  of  ATR  error  for  5  targets. 
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Abstract — In  this  paper,  we  perform  some  theoretical 
studies  on  constant  frequency  (CF)  pulse  waveform  design 
and  diversity  in  radar  sensor  networks  (RSN):  (1)  the 
conditions  for  waveform  co-existence,  (2)  interferences  among 
waveforms  in  RSN,  (3)  waveform  diversity  combining  in  RSN. 
As  an  application  example,  we  apply  the  waveform  design 
and  diversity  to  automatic  target  recognition  (ATR)  in  RSN 
and  propose  maximum-likehood  (ML)-ATR  algorithms  for 
nonfluctuating  target  as  well  as  fluctuating  target.  Simulation 
results  show  that  our  waveform  diversity-based  ML-ATR 
algorithm  performs  much  better  than  single-waveform  ML- 
ATR  algorithm  for  nonfluctuating  targets  or  fluctuating 
targets.  Conclusions  arc  drawn  based  on  our  analysis  and 
simulations. 

I.  Introduction  and  Motivation 

The  network  of  radar  sensors  should  operate  with  mul¬ 
tiple  goals  managed  by  an  intelligent  platform  network 
that  can  manage  the  dynamics  of  each  radar  to  meet  the 
common  goals  of  the  platform,  rather  than  each  radar 
to  operate  as  an  independent  system.  Therefore,  it  is 
significant  to  perform  signal  design  and  processing  and 
networking  cooperatively  within  and  between  platforms  of 
radar  sensors  and  their  communication  modules.  Wave¬ 
form  diversity  is  the  technology  that  will  allow  one  or 
more  sensors  on  board  a  platform  to  automatically  change 
operating  parameters,  e.g.,  frequency,  gain  pattern,  and 
pulse  repetition  frequency  (PRF)  to  meet  the  varying  en¬ 
vironments.  It  has  long  been  recognized  that  judicious  use 
of  properly  designed  waveforms,  coupled  with  advanced 
receiver  strategies,  is  fundamental  to  fully  utilizing  the 
capacity  of  the  electromagnetic  spectrum.  However,  it  is 
only  relatively  recent  advances  in  hardware  technology  that 
are  enabling  a  much  wider  range  of  design  freedoms  to  be 
explored.  As  a  result,  there  are  emerging  and  compelling 
changes  in  system  requirements  such  as  more  efficient 
spectrum  usage,  higher  sensitivities,  greater  information 


content,  improved  robustness  to  errors,  reduced  interfer¬ 
ence  emissions,  etc.  The  combination  of  these  is  fuelling  a 
worldwide  interest  in  the  subject  of  waveform  design  and 
the  use  of  waveform  diversity  techniques. 

In  the  existing  works  on  waveform  design  and  selection, 
Fitzgerald  [5]  demonstrated  the  inappropriateness  of  selec¬ 
tion  of  waveform  based  on  measurement  quality  alone: 
the  interaction  between  the  measurement  and  the  track 
can  be  indirect,  but  must  be  accounted  for.  Bell  [3]  used 
information  theory  to  design  radar  waveform  for  the  mea¬ 
surement  of  extended  radar  targets  exhibiting  resonance 
phenomena.  In  [2],  singularity  expansion  method  was  used 
to  design  some  discriminant  waveforms  such  as  K-pulse, 
E-pulse,  and  S-pulse.  Sowelam  and  Tewfik  [13]  developed 
a  signal  selection  strategy  for  radar  target  classification, 
and  a  sequential  classification  procedure  was  proposed  to 
minimize  the  average  number  of  necessary  signal  trans¬ 
missions.  Intelligent  waveform  selection  was  studied  in 
[1][6],  but  the  effect  of  doppler  shift  was  not  considered. 
In  [7],  the  performance  of  constant  frequency  (CF)  and 
linear  frequency  modulated  (LFM)  waveform  fusion  from 
the  standpoint  of  the  whole  system  was  studied,  but  the 
effects  of  clutter  was  not  considered.  In  [12],  CF  and 
LFM  waveforms  were  studied  for  sonar  system,  but  it  was 
assumed  that  the  sensor  is  nonintelligent  (i.e.,  waveform 
can’t  be  selected  adaptively).  All  the  above  studies  and 
design  methods  were  focused  on  the  waveform  design  or 
selection  for  a  single  active  radar  or  sensor.  In  [10],  cross¬ 
correlation  properties  of  two  radars  are  briefly  mentioned 
and  the  binary  coded  pulses  using  simulated  annealing 
[4]  are  highlighted.  However,  the  cross-correlation  of  two 
binary  sequences  such  as  binary  coded  pulses  (e.g.  Barker 
sequence)  are  much  easier  to  study  than  that  of  two  analog 
radar  waveforms.  In  this  paper,  we  will  focus  on  the 
waveform  diversity  and  design  for  radar  sensor  networks 
using  constant  frequency  (CF)  pulse  waveform. 


The  rest  of  this  paper  is  organized  as  follows.  In  Section 
II,  we  study  the  co-existence  of  radar  waveforms.  In 
Section  III,  we  analyze  the  interferences  among  radar 
waveforms.  In  Section  IV  we  propose  a  RAKE  structure 
for  waveform  diversity  combining  and  propose  maximum- 
likelihood  (ML)  algorithms  for  automatic  target  recogni¬ 
tion  (ATR).  In  Section  V,  we  provide  simulation  results 
on  ML-ATR.  In  Section  VI,  we  conclude  this  paper  and 
provide  some  future  works. 


II.  Co-existence  of  Radar  Waveforms 


In  radar  sensor  networks  (RSN),  radar  sensors  will 
interfere  with  each  other  and  the  signal-to-interference- 
ratio  may  be  very  low  if  the  waveforms  are  not  properly 
designed.  We  will  introduce  orthogonality  as  one  criterion 
for  waveforms  design  in  RSN  to  make  them  co-existence. 
Besides,  the  radar  channel  is  narrow-band,  so  we  will  also 
consider  the  bandwidth  constraint. 

In  our  radar  sensor  networks,  we  choose  CF  pulse 
waveform.  The  CF  pulse  waveform  can  be  defined  as 


x(t)  =  ^|  exp(j27r/3£)  -  T/2  <  t  <  T/2  (1) 

In  radar,  ambiguity  function  (AF)  is  an  analytical  tool  for 
waveform  design  and  analysis  that  succinctly  characterizes 
the  behavior  of  a  waveform  paired  with  its  matched  filter. 
The  ambiguity  function  is  useful  for  examing  resolution, 
side  lobe  behavior,  and  ambiguities  in  both  range  and 
Doppler  for  a  given  waveform[9].  For  a  single  radar, 
the  matched  filter  for  waveform  x(t)  is  x*(—t),  and  the 
ambiguity  function  of  CF  pulse  waveform  is 


A{t,Fd)  = 


,T/2 

/  x(t)  exp  {j2irFDs)x*{t  —  r)dt 

J-T/2+t 

Esin[*FD(T-\Tun  _T<r<T(2) 


TnFn 


However,  the  above  ambiguity  is  for  one  radar  only  (no 
co-existing  radar). 

For  radar  sensor  networks,  the  waveforms  from  differ¬ 
ent  radars  will  interfere  with  each  other.  We  choose  the 
waveform  for  radar  t  as 


xi(t)  =  exp[j27r(/?  +  5i)t\  -  T/2  <  t  <  T/2  (3) 

which  means  there  is  a  frequency  shift  5 *  for  radar  i.  To 
minimize  the  interference  from  one  waveform  to  the  other, 
optimal  values  for  Si  should  be  determined  to  have  the 
waveforms  orthogonal  to  each  other,  i.e.,  let  the  cross¬ 


correlation  between  Xi(t)  and  xn(t)  be  0, 

fT/2 

/  Xi(t)x*n{t)dt 
J  —T/2 

E  f T /2 

=  —  /  exp\j2ix{P  +  <5j)f]  exp[— j'27r(/3  +  5n)t}dt 

*  J-T/2 

=  £sinc[7r(<5i  -  5n)T]  (4) 

If  we  choose 

Si  =  £  (5) 

where  i  is  a  dummy  index,  then  (4)  can  have  two  cases 

/■T/2  (E  i=n 

/  Xi(t)x*n(t)dt=\  (6) 

J-T/2  [o  irn 

So  choosing  Si  =  ^  in  (3)  can  have  orthogonal  waveforms, 
i.e.,  the  waveforms  can  co-exist  if  the  carrier  spacing 
is  1/T  between  two  radar  waveforms,  i.e.,  orthogonality 
amongst  carriers  can  be  achieved  by  separating  the  carriers 
by  an  interger  multiple  of  the  inverse  of  waveform  pulse 
duration.  With  this  design,  all  the  orthogonal  waveforms 
can  work  simultaneously.  However,  there  may  exist  time 
delay  and  doppler  shift  ambiguity  which  will  have  inter¬ 
ferences  to  other  waveforms  in  RSN. 


III.  Interferences  of  Waveforms  In  Radar 
Sensor  Networks 
A.  RSN  with  Two  Radar  Sensors 

We  are  interested  in  analyzing  the  interference  from  one 
radar  to  another  if  there  exist  time  delay  and  doppler  shift. 
For  a  simple  case  where  there  are  two  radar  sensors  ( i 
and  n),  the  ambiguity  function  of  radar  i  (considering 
interference  from  radar  n)  is 


Ai  ( ti ,  tn ,  Fdx  ,  Fon ) 


(7) 


/OO 

[xi(t)  exp{j27rFDit)  +  xn(t  -  tn)  exp{j2-KFDnt)\ 

•OO 


x*(t  -  U)dt\ 

T /2+min(t  i,tn) 


(8) 
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/ 


— T/2+max(ti  ,tn ) 
T/2 


Xn{t  -  tn)  exp(j2irFDri  t)x\ (t  -  ti)dt 


f  ' 

/  Xi(t)  exp  {j2irFDit)x* ( t  - 

J-T/2+U 
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ti)dt 


(9) 


T/2-f  min(ti,£n) 

xn{t  -  tn)  exp(j2irFDnt)xt(t  -  U)dt 

T /2+max(tj,tn) 

Esm[nFDi(T  -  |£i|)] 


TirFoi 

To  make  analysis  easier,  we  assume  ti  —  tn  =  r,  then  (10) 
can  be  simplified  as 


(10) 


Ai  (r,  Foi ,  Fd„  )  ~ 

+ 


ifsinc[7r(n  —  i  +  FdhT)]  | 
Esm[irFDi  (T  —  |r|)] 
TirFoi 


(11) 
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B.  RSN  with  M  Radar  Sensors 


of  branch  1  (after  integration)  is 


It  can  be  extended  to  an  RSN  with  M  radars.  Assuming 
time  delay  r  for  each  radar  is  the  same,  then  the  ambiguity 
function  of  radar  1  (considering  interferences  from  all  the 
other  M  —  1  radars  with  CF  pulse  waveforms)  can  be 
expressed  as 


M  (T i  Fds  ,  •  •  •  ,Fl 


«  ^|£sinc[7r(i-l  +  FD,T)]| 

+  (12) 

1  nBDl 


IV.  Waveform  Diversity  and  Combining  with 
Application  to  Automatic  Target  Recognition 

In  RSN,  The  radar  sensors  are  networked  together  in  an 
ad  hoc  fashion.  Scalability  concern  suggest  a  hierarchical 
organization  of  radar  sensor  networks  with  the  lowest  level 
in  the  hierarchy  being  a  cluster.  In  RSN,  each  radar  can 
provide  their  waveform  parameters  such  as  <5,  to  their 
clusterhead  radar,  and  the  clusterhead  radar  can  combine 
the  waveforms  from  its  cluster  members.  In  RSN  with 
M  radars,  the  received  signal  for  clusterhead  (assume  it’s 
radar  1)  is 

M 

n(u,t)  =  'Y^a(u)xi(t  -  ti)  exp(j2nFDit)  +  n(u,t) 

i—1 

(13) 

where  a(u)  stands  for  radar  cross  section  (RCS)  and  can  be 
modeled  using  non-zero  constants  for  nonfluctuating  target 
and  four  Swerling  target  models  for  fluctuating  target[9]; 
Foi  is  the  doppler  shift  of  target  relative  to  waveform  i; 
ti  is  delay  of  waveform  i,  and  n{u,t )  is  additive  white 
Gaussian  noise  (AWGN).  In  this  paper,  we  propose  a 
RAKE  structure  for  waveform  diversity  combining,  as 
illustrated  by  Fig.  1 . 


| — - *\  H)dt 


x'2(t-t2) 


Fig.  1.  Waveform  diversity  combining  by  clusterhead  in  RSN. 


|^1  (ti,  ^1 ,  •  •  •  ?  Fdm  )  | 

I  rT/ 2  I 


f  ' 

=  /  ri{u,t)x\(t  —  ti)ds  \ 

\J-T/2 

/T/2  M 

[£  0ii(u)xi(t  -  ti)  exp(j2nFDit) 

■T/2 


J~Tr>  i= i 

+n(u,t)]xl(t  —  ti)dt\  (15) 

where  n(u> t)x\(t  —  ti)dt  can  easily  be  proved  to 
be  AWGN,  so 

A  fT/2 

|n(u,ii)|  =  /  n(u,t)xl(t  —  ti)dt  (16) 

J-T/2 


follows  Rayleigh  distribution.  Assuming  fi  =  f2  =  ■  •  ■  = 
tM  =  t,  then  based  on  (12), 

■  •  ,FDm) | 

M 

«  £  |q(?j)ffsinc[7r(i  -  1  +  FDiT)]\ 

+  jgi-“^$MA%(LrP|)1l  +  [„KT)|  (,7) 

Similarly,  we  can  get  the  output  for  any  branch  m  (m  = 

1,2  ,--,M), 

| Zm(u;T,FDi,---  ,FDm) | 

M 

fs  £  \a(u)Esmc[n(i  ~  m  + FotT)]] 

iz=l,i^m 

+  +  |n(u,r)|  (18) 


So  \Zm(u-,  r,  Fd2,  •  ■  ■  ,  Fdm)\  consists  of  three  parts, 
signal  (reflected  signal  from  radar  m  waveform): 

|  a("IEsi;d^Pm(T~M)l  |,  interferences  from  other 

waveforms:  \a{u)Esinc[n(i  -  m  +  FDiT)]\, 

and  noise:  |n(u,r)|. 

We  can  have  three  special  cases  for 
\Zm(u-,T,FDl,-  ■  ■  ,FDm)\: 

1)  When  FDl  =  ■  ■  ■  =  FDm  =  0, 


Zm(u-,T,  0,0,- ,0)| 
Ea(u)(T  —  |t|)] 

T  + 


+  |n(u,r)|  (19) 


According  to  this  structure,  the  received  r  \  ( u ,  t )  is 
processed  by  a  bank  of  matched  filters,  then  the  output 
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which  means  if  there  is  no  doppler  mismatch,  there 
will  be  no  interference  from  other  waveform. 


2)  If  r  =  0,  then  (18)  becomes 


\Zm(u;0,  FDl,  •  •  ■  ,  FDm)\ 


M 


+ 


y;  |a(w)i5sinc[7r(*  -  m  +  F^T)]! 

1=1,1^771 

a{u)E  sin[7r Fp^T] 


T  -kFd„ 


3)  If  r  =  0,  and  FDl  = 
becomes 


+  K«)I  (20) 

FDm  =  0,  then  (18) 


|Zm(u;  0,0,0,  •  ■  •  ,0)|  «  \Ea(u)\  +  |n(w)|  (21) 

Doppler  mismatch  happens  quite  often  in  target 
search  where  target  velocity  is  not  yet  known.  How¬ 
ever,  in  target  recognition,  generally  high-resolution 
measurements  of  targets  in  range  (t  =  0)  and 
doppler  are  available,  so  (21)  will  be  used  for  au¬ 
tomatic  target  recognition. 

How  to  combine  all  the  Zm’s  (m  =  1 , 2,  -  •  ■  ,  M)  are 
very  similar  to  the  diversity  combining  in  communations 
to  combat  channel  fading,  and  the  combination  schemes 
may  be  different  for  different  applications.  In  this  paper, 
we  are  interested  in  applying  RSN  waveform  diversity  to 
automatic  target  recognition  (ATR),  e.g.,  recognition  that 
the  echo  on  a  radar  display  is  that  of  an  aircraft,  ship,  motor 
vehicle,  bird,  etc.  Early  radars  were  “blob”  detectors  in  that 
they  detected  the  presence  of  a  target  and  gave  its  location 
in  range  and  angle,  and  radar  began  to  be  more  than  a 
blob  detector  and  could  provide  recognition  of  one  type 
of  target  from  another[10].  It  is  known  that  small  changes 
in  the  aspect  angle  of  complex  (multiple  scatter)  targets 
can  cause  major  changes  in  the  radar  cross  section  (RCS). 
This  has  been  considered  in  the  past  as  a  means  of  target 
recognition,  and  is  called  fluctuation  of  radar  cross  section 
with  aspect  angle,  but  it  has  not  had  much  success[10].  In 
this  paper,  we  propose  a  maximum  likelihood  automatic 
target  recognition  (ML-ATR)  algorithm  for  RSN.  We  will 
study  non-fluctuating  target  as  well  as  fluctuating  target. 


cr2  is  the  noise  power  (with  I  and  Q  sub-channel  power 
<t2/2),  and  70(-)  is  the  zero-order  modified  Bessel  function 
of  the  first  kind.  Let  y  =  [2/1 ,  y2,  ■  ■  •  ,2/m],  then  the  pdf  of 
y  is 

M 

/( y)  =  n  /(**»)  (24) 

771=1 

Our  ATR  is  a  multiple-category  hypothesis  testing  prob¬ 
lem,  i.e.,  to  decide  a  target  category  (e.g.  aircraft,  ship, 
motor  vehicle,  bird,  etc)  based  on  n(u,t).  Assume  there 
are  totally  N  categories  and  category  n  target  has  RCS 
an,  so  the  ML-ATR  algorithm  to  decide  a  target  category 
C  can  be  expressed  as, 


C  =  argmax„=1/(y|A  =  E\an\) 

M  0 

N  TT  r  ' 

=  argmaxn=1  [[  -^-exp[-- 

771=1 


flm  +  E2al),T  ,2E\an\yn 
- - For 15 — 


B.  ML-ATR  for  Fluctuating  Targets 

Fluctuating  target  modeling  is  more  realistic  in  which 
the  target  RCS  is  drawn  from  either  the  Rayleigh  or 
chi-square  of  degree  four  pdf.  The  Rayleigh  model  de¬ 
scribes  the  behavior  of  a  complex  target  consisting  of 
many  scatters,  none  of  which  is  dominant.  The  fourth- 
degree  chi-square  models  targets  having  many  scatters 
of  similar  strength  with  one  dominant  scatter.  Based  on 
different  combinations  of  pdf  and  decorrelation  character¬ 
istics  (scan-to-scan  or  pulse-to-pulse  decorrelation),  four 
Swerling  models  are  used[9].  In  this  paper,  we  will  focus 
on  “Swerling  2”  model  which  is  Rayleigh  distribution  with 
pulse-to-pulse  decorrelation.  The  pulse-to-pulse  decorre¬ 
lation  implies  that  each  individual  pulse  results  in  an 
independent  value  for  RCS  a. 

For  Swerling  2  model,  the  RCS  |a(u)|  follows  Rayleigh 
distribution  and  its  I  and  Q  subchannels  follow  zero-mean 
Gaussian  distributions  with  variance  j2.  Assume 

q(u)  =  ai{u)  +  jctQ{u)  (25) 


A.  ML-ATR  for  Non -fluctuating  Targets 

In  some  sources,  the  non-fluctuating  target  is  identified 
as  “Swerling  0”  or  “Swerling  5”  modelfll].  For  non¬ 
fluctuating  target,  the  RCS  am(u)  is  just  a  constant  a  for  a 
given  target.  In  (21),  \n(u,  r)|  follows  Rayleigh  distribution 
since  n(u,  r)  is  a  Gaussian  random  variable  for  given  t, 
so  \Zm(u;  0, 0,  •  •  •  ,0)|  follows  Rician  distribution  because 
signal  E\a  is  a  constant.  Let  ym  =  \Zm(u\  0, 0,  •  ■  •  ,0)|, 
then  the  probability  density  function  (pdf)  of  ym  is 


/(2/m) 


22/m _ f  (Vm  +  ^2)  i  T  I  2^2/m 

-5-exp[ - - ]Io(— j~)  (22) 


where 

A  =  £|a|,  (23) 


and  n(u)  —  ni(u)  +  jriQ(u)  follows  zero-mean  complex 
Gausian  distribution  with  variance  a2  for  the  I  and  Q 
subchannels.  According  to  (15),  (18),  and  (21), 


|Zm(u;0,0,0,--  -  ,0)|  »  \Ea(u)+n(u)\  (26) 


is  a  more  accuate  approximation.  Since  a(u)  and  n(u) 
are  zero-mean  complex  Gaussian  random  variables,  so 
Ea(u)  +  n(u)  is  a  zero-mean  Gaussian  random  variable 
with  variance  E2 -y2  +  a2  for  the  I  and  Q  subchannels, 
which  means  ym  =  \Zrn(u;  0, 0,  ■  •  ■  ,  0) |  follows  Rayleigh 
distribution  with  parameter  y/E2 j2  +  a 2, 


/  (2/m)  — 


Vm 


E2 72  +  cr2 


exp(- 


E2 72  +  a2 


(27) 
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The  mean  value  of  ym  is  y  and  variance  is 

(4-7r)(g^  7_+g  ).  j^g  varjance  0f  sjgnal  is  7  ancj 

the  variance  of  noise  is  (4~^)<J .  _ 

Let  y  =  [j/i,2/2,  •  • '  i  yw].  then  the  pdf  of  y  is 


M 

/( y)  =  IJ  /(ym)  (28) 

m=l 


Assume  there  are  totally  N  categories  and  category  n 
target  has  RCS  an(u)  (with  variance  7^),  so  the  ML-ATR 
algorithm  to  decide  a  target  category  C  can  be  expressed 
as, 


argmax^=1/(y|7  =  7„) 
M 

Vm 


(29) 


n  "i  r  ym-  /  y  777, 

=  *tgn“x"-  n  wwi  exp(-E^rr 


V.  Simulations 

Radar  sensor  networks  will  be  required  to  detect  a 
broad  range  of  target  classes.  Too  often,  the  characteristics 
of  objects  that  are  not  of  interest  (e.g.,  bird)  will  be 
similar  to  those  of  threat  objects  (e.g.,  missile).  Therefore, 
new  techniques  to  discriminate  threat  against  undesired 
detections  (e.g.  birds,  etc.)  are  needed.  We  applied  our  ML- 
ATR  to  this  important  application,  to  recognize  a  target 
from  many  target  classes.  We  assume  that  the  domain  of 
target  classes  is  known  a  priori  (Ar  in  Sections  IV-A  and 
IV-B),  and  that  the  RSN  is  confined  to  work  only  on  the 
known  domain. 


TABLE  I 

RCS  VALUES  AT  MICROWAVE  FREQUENCY  FOR  5  TARGETS. 


Index  n 

Target 

RCS  (rri?) 

1 

Bird 

0.01 

2 

Conventional  unmanned  winged  missile 

0.5 

3 

Small  single-engine  aircraft 

1 

4 

Small  flighter  aircraft  or  4  passenger  jet 

2 

5 

Large  flighter  aircraft 

6 

For  non-fluctuating  target  recognition,  our  targets  have 
5  classes  with  different  RCS  values,  which  are  summa¬ 
rized  in  Table  I[  1 0].  For  fluctuating  target  recognition,  we 
assume  the  fluctuating  targets  follow  “Swerling  2”  model 
(Rayleigh  with  pulse-to-pulse  decorrelation),  and  assume 
the  RCS  value  listed  in  Table  I  to  be  the  standard  deviation 
(std)  7„  of  RCS  an(u)  for  target  n.  We  applied  the 
ML-ATR  algorithms  in  Section  IV-A  (for  nonfluctuating 
target  case)  and  Section  IV-B  (for  fluctuating  target  case) 
to  classify  an  unknown  target  as  one  of  these  5  target 
classes.  At  each  average  SNR  value,  we  ran  Monte-Carlo 
simulations  for  105  times  for  each  target.  The  average 
SNR  value  is  based  on  the  average  power  from  all  tar¬ 
gets  (signal  variance),  so  the  actual  SNRs  for  bird  and 


(a) 


(b) 


(c) 


Fig.  2.  Probability  of  ATR  error  for  nonfluctuating  targets  at  different 
average  SNR  (dB)  values,  (a)  bird,  (b)  missile,  (c)  the  average  probability 
of  ATR  error  for  5  targets. 


missile  are  much  lower  than  the  average  SNR  value.  For 
example,  at  the  average  SNR=16dB,  the  bird  target  SNR=- 
33.1646dB,  and  missile  target  SNR=0.8149dB;  and  at 
average  SNR=20dB,  the  bird  target  SNR=-29.1646dB,  and 
missile  target  SNR=4.8149dB.  In  Fig.  2(a)(b),  we  plotted 
the  probability  of  ATR  error  in  bird  and  missile  recognition 
when  they  are  assumed  as  nonfluctuating  targets.  Observe 
both  figures,  single  radar  system  can’t  perform  well  in 
both  recognitions,  and  their  probability  of  ATR  error  is 
above  10%,  which  can’t  be  used  for  real-world  ATR. 
However,  the  5-radar  RSN  and  10-radar  RSN  can  maintain 
very  low  ATR  errors.  In  Fig.  2(c),  we  plotted  the  average 
probability  of  ATR  error  for  all  5  targets  recognition. 
Since  the  other  3  targets  (different  aircrafts)  have  much 
higher  SNRs,  so  their  ATR  error  is  lower,  which  makes  the 
average  probability  of  ATR  error  lower.  Similarly,  we  plot 
the  performance  in  fluctuating  target  recognition  in  Fig. 
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(a) 


(b) 


(c) 


Fig.  3.  Probability  of  ATR  error  for  fluctuating  targets  at  different 
average  SNR  (dB)  values,  (a)  bird,  (b)  missile,  (c)  the  average  probability 
of  ATR  error  for  5  targets. 

3(a)b)(c).  Comparing  Fig.  2(a)(b)(c)  to  Fig.  3(a)(b)(c),  it 
is  clear  that  higher  SNRs  are  needed  for  fluctuating  target 
recognition  comparing  to  nonfluctuating  target  recogni¬ 
tion.  According  to  Skolnik[10],  radar  performance  with 
probability  of  recognition  error  (pe)  less  than  10%  is 
good  enough.  Our  RSN  with  waveform-diversity  can  have 
probability  of  ATR  error  much  less  than  10%  for  each 
target  ATR  as  well  as  the  average  ATR  for  all  targets. 
However,  the  single  radar  system  has  probability  of  ATR 
error  much  higher  than  10%.  Observe  Fig.  3(c),  the  average 
probability  of  ATR  error  of  single-radar  is  impossible  to 
be  less  than  10%  even  at  extreme  high  SNR.  Our  RSN 
with  waveform  diversity  is  very  promising  to  be  used  for 
real-world  ATR. 

VI.  Conclusions 

We  have  studied  constant  frequency  (CF)  pulse  wave¬ 
form  design  and  diversity  in  radar  sensor  networks  (RSN). 


We  showed  that  the  waveforms  can  co-exist  if  the  carrier 
frequency  spacing  is  1/T  between  two  radar  waveforms. 
We  made  analysis  on  interferences  among  waveforms 
in  RSN  and  proposed  a  RAKE  structure  for  waveform 
diversity  combining  in  RSN.  As  an  application  example, 
we  applied  the  waveform  design  and  diversity  to  automatic 
target  recognition  (ATR)  in  RSN  and  proposed  maximum- 
likehood  (ML)-ATR  algorithms  for  nonfluctuating  target 
as  well  as  fluctuating  target.  Simulation  results  show  that 
RSN  using  our  waveform  diversity-based  ML-ATR  algo¬ 
rithm  performs  much  better  than  single  radar  system  for 
nonfluctuating  targets  and  fluctuating  targets  recognition. 
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Abstract 

Automatic  target  recognition  (ATR)  in  target  search  phase  is  very  challenging  because  the 
target  range  and  mobility  are  not  yet  perfectly  known,  which  results  in  delay-doppler  uncer¬ 
tainty.  In  this  paper,  we  firstly  perform  some  theoretical  studies  on  radar  sensor  network  (RSN) 
design  based  linear  frequency  modulation  (LFM)  waveform:  (1)  the  conditions  for  waveform 
co-existence,  (2)  interferences  among  waveforms  in  RSN,  (3)  waveform  diversity  in  RSN.  Then 
we  apply  RSN  to  ATR  with  delay-doppler  uncertainty  and  propose  maximum-likekihood  (ML) 

ATR  algorithms  for  fluctuating  target  and  nonfluctuating  target.  Simulation  results  show  that 
our  RSN  vastly  reduces  the  ATR  error  comparing  to  a  single  radar  system  in  ATR  with  delay- 
doppler  uncertainty.  The  proposed  waveform  design  and  diversity  algorithms  can  also  be  applied 
to  active  RFID  sensor  networks  and  underwater  acoustic  sensor  networks. 

Index  Terms  :  radar  sensor  networks,  waveform  diversity,  automatic  target  recognition, 
maximum-likelihood,  interferences,  delay-doppler  uncertainty. 
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1  Introduction  and  Motivation 


The  goal  for  any  target  recognition  system  is  to  give  the  most  accurate  interpretation  of  what  a 
target  is  at  any  given  point  in  time.  There  are  two  classes  of  motion  models  of  target,  one  for 
maneuvering  targets  and  one  for  non-maneuvering  (constant  velocity  and  acceleration)  targets. 
The  area  that  is  still  lacking  in  target  recognition  is  the  ability  to  detect  reliably  when  a  target  is 
beginning  a  maneuver  where  its  speed  and  range  are  uncertain.  The  tracking  system  can  switch  the 
algorithms  applied  to  the  problem  from  a  non-maneuvering  set  to  the  maneuvering  set  when  a  target 
is  beginning  a  maneuver.  But  when  the  tracker  does  finally  catch  up  to  the  target  after  the  maneuver 
and  then  perform  ATR,  the  latency  is  too  high.  In  time  critical  mission  situation,  such  latency  in 
ATR  is  not  tolerable.  In  this  paper,  we  are  interested  in  studying  automatic  target  recongition  with 
range  and  speed  uncertainty,  i.e.,  delay-doppler  uncertainty,  using  radar  sensor  networks  (RSN). 
The  network  of  radar  sensors  should  operate  with  multiple  goals  managed  by  an  intelligent  platform 
network  that  can  manage  the  dynamics  of  each  radar  to  meet  the  common  goals  of  the  platform, 
rather  than  each  radar  to  operate  as  an  independent  system.  Therefore,  it  is  significant  to  perform 
signal  design  and  processing  and  networking  cooperatively  within  and  between  platforms  of  radar 
sensors  and  their  communication  modules.  In  this  paper,  we  are  interested  in  studying  algorithms 
on  radar  sensor  network  (RSN)  design  based  linear  frequency  modulation  (LFM)  waveform:  (1) 
the  conditions  for  waveform  co-existence,  (2)  interferences  among  waveforms  in  RSN,  (3)  waveform 
diversity  in  RSN.  Then  we  apply  RSN  to  automatic  target  recognition  (ATR)  with  delay-doppler 
uncertainty. 

In  nature,  diverse  waveforms  are  transmitted  by  animals  for  specific  applications.  For  example, 
when  a  bat  and  a  whale  are  in  the  search  mode  for  food,  they  emit  a  different  type  of  waveform  than 
when  they  are  trying  to  locate  their  prey.  The  Doppler-invariant  waveforms  that  they  transmit 
are  environment  dependent  [8].  Hence,  in  RSN,  it  may  be  useful  to  transmit  different  waveform 
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from  different  neighbor  radars  and  they  can  collaboratively  perform  waveforms  diversity  for  ATR. 
Sowelam  and  Tewfik  [21]  developed  a  signal  selection  strategy  for  radar  target  classification,  and 
a  sequential  classification  procedure  was  proposed  to  minimize  the  average  number  of  necessary 
signal  transmissions.  Intelligent  waveform  selection  was  studied  in  [1]  [9] ,  but  the  effect  of  doppler 
shift  was  not  considered.  In  [12],  the  performance  of  constant  frequency  (CF)  and  linear  frequency 
modulated  (LFM)  waveform  fusion  from  the  standpoint  of  the  whole  system  was  studied,  but  the 
effects  of  clutter  was  not  considered.  In  [20],  CF  and  LFM  waveforms  were  studied  for  sonar  system, 
but  it  was  assumed  that  the  sensor  is  nonintelligent  (i.e.,  waveform  can’t  be  selected  adaptively).  All 
the  above  studies  and  design  methods  were  focused  on  the  waveform  design  or  selection  for  a  single 
active  radar  or  sensor.  In  [18],  cross-correlation  properties  of  two  radars  are  briefly  mentioned 
and  the  binary  coded  pulses  using  simulated  annealing  [4]  are  highlighted.  However,  the  cross¬ 
correlation  of  two  binary  sequences  such  as  binary  coded  pulses  (e.g.  Barker  sequence)  are  much 
easier  to  study  than  that  of  two  analog  radar  waveforms.  In  [10],  CF  waveform  design  was  applied 
to  RSN  with  application  to  ATR  without  any  delay-doppler  uncertainty.  In  this  paper,  we  will 
focus  on  the  waveform  design  fusion  for  radar  sensor  networks  using  LFM  waveform. 

The  rest  of  this  paper  is  organized  as  follows.  In  Section  2,  we  study  the  co-existence  of  LFM 
radar  waveforms.  In  Section  3,  we  analyze  the  interferences  among  LFM  radar  waveforms.  In 
Section  4  we  propose  a  RAKE  structure  for  waveform  diversity  combining  and  propose  maximum- 
likelihood  (ML)  algorithms  for  automatic  target  recognition  (ATR)  with  delay-doppler  uncertainty. 
In  Section  5,  we  provide  simulation  results  on  ML- ATR  with  delay-doppler  uncertainty.  In  Section 
6,  we  conclude  this  paper  and  provide  some  future  works. 
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2  Co-existence  of  LFM  Radar  Waveforms 


In  radar  sensor  networks  (RSN),  radar  sensors  will  interfere  with  each  other  and  the  signal-to- 
interference-ratio  may  be  very  low  if  the  waveforms  are  not  properly  designed.  We  will  introduce 
orthogonality  as  one  criterion  for  waveforms  design  in  RSN  to  make  them  co-existence.  Besides, 
the  radar  channel  is  narrow-band,  so  we  will  also  consider  the  bandwidth  constraint. 

In  our  radar  sensor  networks,  we  choose  linear  frequency  modulation  (LFM)  waveform.  The 
LFM  waveform  can  be  defined  as 

x(t)  =  exp(j2?r fit2)  -  Tj 2  <  t  <  T/2  (1) 

In  radar,  ambiguity  function  (AF)  is  an  analytical  tool  for  waveform  design  and  analysis  that 
succinctly  characterizes  the  behavior  of  a  waveform  paired  with  its  matched  filter.  The  ambiguity 
function  is  useful  for  examing  resolution,  side  lobe  behavior,  and  ambiguities  in  both  range  and 
Doppler  for  a  given  waveform[16].  For  a  single  radar,  the  matched  filter  for  waveform  x(t )  is  x*(-t), 
and  the  ambiguity  function  of  LFM  waveform  is[16] 

rT/2 

A(t,  Fd)  =  /  x(t)  exp  (j2nF£>t)x*(t  -  r)dt 

J-T/2+t 

Es'm[n(FD  +  (3t)(T  -  |r|)] 

Tit(Fd  +  fir) 

Three  special  cases  can  simplify  this  AF : 


(3) 

(4) 

(5) 
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However,  the  above  ambiguity  is  for  one  radar  only  (no  co-existing  radar). 


For  radar  sensor  networks,  the  waveforms  from  different  radars  will  interfere  with  each  other. 
We  choose  the  waveform  for  radar  i  as 

Xi(t)  =  \j ^ exp\j2ir(/3t2  +  6^)}  -  T/2  <  t  <  T/2  (6) 

which  means  there  is  a  frequency  shift  Si  for  radar  i.  To  minimize  the  interference  from  one  waveform 
to  the  other,  optimal  values  for  Si  should  be  determined  to  have  the  waveforms  orthogonal  to  each 
other,  i.e. ,  let  the  cross-correlation  between  X{(t)  and  xn(t)  be  0, 

/•T/2 

/  Xi(t)x*n{t)dt 
J  —T/2 

E  [T/ 2 

=  /  exp[;'27r(/3t2  +  <M)]  exp[-j27r(/ft2  +  5nt)\dt 

1  J-T/2 

=  JFsinc[7r((5i  -  5„)T]  (7) 


If  we  choose 


(8) 


where  i  is  a  dummy  index,  then  (7)  can  have  two  cases 


E  i  =  n 

< 

0  i  7^  n 


(9) 


So  choosing  Si  =  ^  in  (6)  can  have  orthogonal  waveforms,  i.e.,  the  waveforms  can  co-exist  if  the 
carrier  spacing  is  1  /T  between  two  radar  waveforms,  i.e.,  orthogonality  amongst  carriers  can  be 
achieved  by  separating  the  carriers  by  an  interger  multiple  of  the  inverse  of  waveform  pulse  duration. 
With  this  design,  all  the  orthogonal  waveforms  can  work  simultaneously.  However,  there  may  exist 
time  delay  and  doppler  shift  ambiguity  which  will  have  interferences  to  other  waveforms  in  RSN. 
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3  Interferences  of  LFM  Waveforms  In  Radar  Sensor  Networks 


3.1  RSN  with  Two  Radar  Sensors 

We  are  interested  in  analyzing  the  interference  from  one  radar  to  another  if  there  exist  time  delay 
and  doppler  shift.  For  a  simple  case  where  there  are  two  radar  sensors  ( i  and  n),  the  ambiguity 
function  of  radar  i  (considering  interference  from  radar  n )  is 


d?  i  tn,  PDi,FDn ) 

/oo 

[xi(t)  exp(j2nFD.t)  +  xn(t  -  tn)exp(j2irFDnt)]x*(t  -  ti)dt\ 

-00 


< 


+ 


-00 

/•T/2+mm(ti,tn) 


+ 


/  xn(t  -  tn )  exp( j27r Fdu t)x* (t  -  ti)dt 

d-T/2+max(t,-,tn) 

fT/2 

/  Xi(t)  exp  (j2-KFDit)x* ( t  -  ti)dt 

J-T/2+U 

/•T/2-fmin(ti,tn) 

/  xn(t-  tn)  exp(j'27r FDnt)x* (t  -  U)dt 

J -T /2+max(tj  ,tn) 

Esin[K{FDi  +  0ti){T  -  \U\)] 


Tn(FDi  +  (3ti) 

To  make  analysis  easier,  we  assume  U  =  tn  =  r,  then  (13)  can  be  simplified  as 

Ai{T,FDi,FDn)  «  |£sinc[7r(n  -  i  +  FDnT)}\ 

,  \Esm[K(FDi+PT)(T-\T\)} 


Tn(FD.  +  (3t) 


Some  special  cases  of  (14)  are  listed  as  follows: 

1.  If  Fp.  =  Fdu  =  0,  then  (14)  becomes 

Mr,  0,0)  « 

2.  If  r  =  0,  then  (14)  becomes 


E  sin[7r  (3t(T  —  |r|)] 


7t/3Tt 


M0,FDi,FDn)  «  \Esmc[n(n-i  + FDnT)]\ 
+  |Fisinc(7rFDiT)| 


(10) 

(11) 


(12) 


(13) 


(14) 


(15) 


(16) 
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3.  If  Fd{  =  Fon  =  0,  t  —  0,  and  <5,;  and  5n  follow  (8),  then  (14)  becomes 


A  (0,0,0) 


(17) 


3.2  RSN  with  M  Radar  Sensors 

It  can  be  extended  to  an  RSN  with  M  radars.  Assuming  time  delay  r  for  each  radar  is  the  same, 
then  the  ambiguity  function  of  radar  1  (considering  interferences  from  all  the  other  M  —  1  radars 
with  CF  pulse  waveforms)  can  be  expressed  as 


M(j,Fd i,-"  ,FDm)  « 


+ 


M 


Y  Flsinc[-7r(i  -  1  +  FDiT)\ 


i=2 


.Esin^F^  +  /3t)(T  -  |t|)] 


Ttt(FDi  +  0t) 


(18) 


Similarly,  we  can  have  three  special  cases, 


1.  If  Fdx  —  Fp2  =  •  •  •  =  Fqm  =  0,  then  (18)  becomes 

\Esin{npT(T  —  |r|)] 


Ai(t,0, (),•••  ,0) 


tc(3Tt 


(19) 


comparing  it  against  (3),  it  shows  that  our  derived  condition  in  (6)  can  have  a  radar  in  RSN 
get  the  same  signal  strength  as  that  of  a  single  radar  (no  co-existing  radar)  when  the  doppler 
shift  is  0. 


2.  If  r  =  0,  then  (18)  becomes 


M  (0,  FDl ,  FD2 ,  •  •  •  ,  FDm  ) 

M 

'Y  -Fsinc[7r(z  —  1  +  Fp^T  +  @tT )] 


(20) 


i~  1 


Comparing  to  (4),  a  radar  in  RSN  has  more  interferences  when  unknown  doppler  shifts  exist. 


3.  Fd j  =  Fd2  —  ■  ■  ■  =  Fdm  =  0,  t  =  0,  and  Si  in  (6)  follows  (8),  then  (18)  becomes 


A(0,0,0,...  ,0 )«£? 


(21) 
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4  Application  to  Automatic  Target  Recognition  (ATR)  with  Delay- 
Doppler  Uncertainty 

In  RSN,  The  radar  sensors  are  networked  together  in  an  ad  hoc  fashion.  They  do  not  rely  on  a 
preexisting  fixed  infrastructure,  such  as  a  wireline  backbone  network  or  a  base  station.  They  are  self¬ 
organizing  entities  that  are  deployed  on  demand  in  support  of  various  events  surveillance,  battlefield, 
disaster  relief,  search  and  rescue,  etc.  Scalability  concern  suggest  a  hierarchical  organization  of  radar 
sensor  networks  with  the  lowest  level  in  the  hierarchy  being  a  cluster.  As  argued  in  [11]  [7]  [6]  [14], 
in  addition  to  helping  with  scalability  and  robustness,  aggregating  sensor  nodes  into  clusters  has 
additional  benefits: 

1.  conserving  radio  resources  such  as  bandwidth; 

2.  promoting  spatial  code  reuse  and  frequency  reuse; 

3.  simplifying  the  topology,  e.g.,  when  a  mobile  radar  changes  its  location,  it  is  sufficient  for 
only  the  nodes  in  attended  clusters  to  update  their  topology  information; 

4.  reducing  the  generation  and  propagation  of  routing  information;  and, 

5.  concealing  the  details  of  global  network  topology  from  individual  nodes. 

In  RSN,  each  radar  can  provide  their  waveform  parameters  such  as  Si  to  their  clusterhead  radar, 
and  the  clusterhead  radar  can  combine  the  waveforms  from  its  cluster  members. 

In  RSN  with  M  radars,  the  received  signal  for  clusterhead  (assume  it’s  radar  1)  is 

M 

ri{u,t)  =  y;  a(u)xj(t  -  tj)  exp(j27rFDit)  +  n(u,  t )  (22) 

i= l 

where  a(u)  stands  for  radar  cross  section  (RCS)  and  can  be  modeled  using  non-zero  constants  for 
nonfluctuating  target  and  four  Swerling  target  models  for  fluctuating  target[16];  is  the  doppler 
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shift  of  target  relative  to  waveform  i;  t*  is  delay  of  waveform  i,  and  n(u,t )  is  additive  white  Gaussian 
noise  (AWGN).  In  this  paper,  we  propose  a  RAKE  structure  for  waveform  diversity  combining,  as 
illustrated  by  Fig.  1. 

According  to  this  structure,  the  received  ri(u,t)  is  processed  by  a  bank  of  matched  filters,  then 
the  output  of  branch  1  (after  integration)  is 


>  Fd-j  > '  '  ’ 

T/2 
T/2 

T/2  M 


/  ri(u,t)a;i(t  -  ti)ds 
J —T/2 

{•T/2  M 

=  \  lY' ai(u)xi(t  -  ti)exp(j2-KFDit)  +  n(u,t)]xl(t  -  ti)dt\ 

J —T/2  i=1 


where  yy2n(u,t)x\(t  -  t\)dt  can  easily  be  proved  to  be  AWGN,  let 

r t/2 


a  r; 

n{u,t\)  =  /  n{u,t)x\(t  -  t\)dt 
J-T/2 


follows  Rayleigh  distribution.  Assuming  t\  =  £2  =  •  •  •  =  tM  =  t,  then  based  on  (18), 


(23) 

(24) 


(25) 


\Zi  («;  r,  Fpj ,  •  •  •  ,Fdm)  I 


M 


j  ^  a(u)Fsinc[7r(i  —  1  +  F/^T)] 


i=2 


,  a(u)Fsin[T(Fjg1  +/?r)(T- jrj)]  ,  ,, 

T/k(Fjd1  +  0t)  K  ’  ;| 


(26) 


Similarly,  we  can  get  the  output  for  any  branch  m  [m  —  1, 2,  ■  •  •  ,  M), 


+ 


I  Zm  (u,  t,  Fj)1 ,  •  •  •  ,  Fdm  )  | 

M 

|  ^  a(u)Fsinc[7r(f  —  m  +  Fd{T )] 

1=1,2^771 

a(u)Fsin[7r(Fz)m  +  /?t)(T-  |r|)] 


T*  (FDm+pT) 


+  n(u,r)  | 


(27) 


So  |Zm(u;  r,  F/jj ,  •  •  •  ,  Fdm)|  consists  of  three  parts,  signal  (reflected  signal  from  radar  m  waveform): 


a{u)E  sinfrCgbg,  +0t)(T-\t\)} 
rt'ir(FDm+/3T) 


interferences  from  other  waveforms:  YhiL\  iJ=m  |o:(w)Fsinc[7r(j  —  m  +  Fd.T)]|, 
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and  noise:  \n(u,  r)|.  Delay-doppler  uncertainty  happens  quite  often  in  target  search  and  recognition 
where  target  range  and  velocity  are  not  yet  perfectly  known. 


We  can  also  have  three  special  cases  for  \Ztii(u;t,Fd1,  •  ■  •  , Fdm ) \ ■ 

1.  When  FDl  =  ■  •  •  =  FDm  -  0, 

\Zm{u]T,  0, 0,  •  •  •  ,0)| 

«  |a(")Esin[T<3T(r-lTl)]+n(„,T)|  (28) 

Ti\  (3t 

2.  If  t  =  0,  then  (27)  becomes 

\Zm{u-,0,FDl,-  ■  ■  ,FDm) | 

M 

w  |  ^2  a(tt)^sinc[7r(f  -  m  +  F^T)}  +  n(u)  |  (29) 

i=l 

3.  If  r  =  0,  and  Fdx  =  ■  ■  ■  —  FpM  =  0,  then  (27)  becomes 

| Zm(u]  0, 0, 0,  •  •  •  ,0)|  «  | Ea(u)  +  n(u)|  (30) 


How  to  combine  all  the  Zm’ s  (m  =  1,2,  •••  ,  M)  are  very  similar  to  the  diversity  combining 
in  communations  to  combat  channel  fading,  and  the  combination  schemes  may  be  different  for 
different  applications.  In  this  paper,  we  are  interested  in  applying  RSN  waveform  diversity  to 
automatic  target  recognition  (ATR),  e.g.,  recognition  that  the  echo  on  a  radar  display  is  that  of  an 
aircraft,  ship,  motor  vehicle,  bird,  person,  rain,  chaff,  clear-air  turbulence,  land  clutter,  sea  clutter, 
bare  montains,  forested  areas,  meteors,  aurora,  ionized  media,  or  other  natural  phenomena.  Early 
radars  were  “blob”  detectors  in  that  they  detected  the  presence  of  a  target  and  gave  its  location  in 
range  and  angle,  and  radar  began  to  be  more  than  a  blob  detector  and  could  provide  recognition 
of  one  type  of  target  from  another[18].  It  is  known  that  small  changes  in  the  aspect  angle  of 
complex  (multiple  scatter)  targets  can  cause  major  changes  in  the  radar  cross  section  (RCS).  This 
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has  been  considered  in  the  past  as  a  means  of  target  recognition,  and  is  called  fluctuation  of  radar 
cross  section  with  aspect  angle,  but  it  has  not  had  much  success[18].  In  this  paper,  we  propose 
a  maximum  likelihood  automatic  target  recognition  (ML-ATR)  algorithm  for  RSN.  We  will  study 
both  fluctuating  target  and  non-fluctuating  target. 


4.1  ML-ATR  for  Fluctuating  Targets  with  Delay-Doppler  Uncertainty 

Fluctuating  target  modeling  is  more  realistic  in  which  the  target  RCS  is  drawn  from  either  the 
Rayleigh  or  chi-square  of  degree  four  pdf.  The  Rayleigh  model  describes  the  behavior  of  a  complex 
target  consisting  of  many  scatters,  none  of  which  is  dominant.  The  fourth-degree  chi-square  models 
targets  having  many  scatters  of  similar  strength  with  one  dominant  scatter.  Based  on  different 
combinations  of  pdf  and  decorrelation  characteristics  (scan-to-scan  or  pulse-to-pulse  decorrelation), 
four  Swerling  models  are  used[16].  In  this  paper,  we  will  focus  on  “Swerling  2”  model  which  is 
Rayleigh  distribution  with  pulse-to-pulse  decorrelation.  The  pulse-to-pulse  decorrelation  implies 
that  each  individual  pulse  results  in  an  independent  value  for  RCS  a. 

For  Swerling  2  model,  the  RCS  |a(«)|  follows  Rayleigh  distribution  and  its  I  and  Q  subchannels 
follow  zero- mean  Gaussian  distributions  with  variance  q2.  Assume 


a{u)  =  a  flu)  +jaQ(u) 


(31) 


and  n(u)  =  nflu)  +  jnQ(u)  follows  zero-mean  complex  Gausian  distribution  with  variance  c2  for 
the  I  and  Q  subchannels.  Observe  (27),  for  given  r,  F^  (i  =  1,-  •  •  ,  M), 

t  «M»M*  -  ■»  +  fB.t)\  h-  a(")£5it^I++tf  ~  W>1 


=  a{u)E[  sinc[7r(f  -  m  +  FDiT))  +  Sm^'  —  ]  (32) 

Tn(FDTn+flT) 

follows  zero-mean  complex  Gaussian  distributions  with  variance  .E2q2  Efli  j^msmc[7r(i  —  m  + 
t-w(fd  +Pt) - j  f°r  the  I  and  Q  subchannels.  Since  n(u,r )  also  follows  zero-mean 


pD  t)]  +  giuMFgm  +0t)(t~\t\)\  12 
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Gaussian  distribution,  so  \Zm(u;  t,  ,  •  •  •  ,  Fdm)\  °f  (27)  follows  Rayleigh  distribution.  In  real 
world,  the  perfect  values  of  r  and  Fd{  are  not  known  in  the  target  search  phase  and  the  mean  values 
of  t  and  Fp.  are  0,  so  we  just  assume  the  parameter  of  this  Rayleigh  distribution  b  =  \J E2ry2  +  u2 
(when  r  and  Fr>i  equal  to  0). 

Let  ym  =  \Zm(u\  t,  Fd1  ,  •  •  ■  ,  Fdm  )  | ,  then 


f(ym)  -  £272  +  a2  eXP(  £pjT+  a2  ) 


(33) 


The  mean  value  of  ym  is  \J~^~ 

(4-7T )E  7  an(^  varjance  Qf  noise  is  (4-ir)0'  . 


(E2y2+a2) ,  and  variance  js  (4  7r)(g272+a2)  ^  -p^e  varjance  0f  signal  is 


Let  y  =  [yi,y2,  ■  •  ■  ,2/m],  then  the  pdf  of  y  is 


M 


/( y)  =  n  /(»"») 


(34) 


m=l 


Our  ATR  is  a  multiple-category  hypothesis  testing  problem,  i.e.,  to  decide  a  target  category 
(e.g.  different  aircraft,  motor  vehicle,  etc)  based  on  r\(u,t).  Assume  there  are  totally  N  categories 
and  category  n  target  has  RCS  an(u)  (with  variance  7^),  so  the  ML-ATR  algorithm  to  decide  a 
target  category  C  can  be  expressed  as, 


C  =  arg  max  /(y|7  =  7„) 

n= 1,—  ,N 


(35) 


=  arg  max 


M 

n 


Vm 


exp(— 


E^ii  +  cr2  E^l  +  u2 


f) 


4.2  ML-ATR  for  Non-fluctuating  Targets  with  Delay-Doppler  Uncertainty 


In  some  sources,  the  non-fluctuating  target  is  identified  as  “Swerling  0”  or  “Swerling  5”  model[19]. 
For  non-fluctuating  target,  the  RCS  a(u )  is  just  a  constant  a  for  a  given  target.  Observe  (27),  for 
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given  r,  FDi  (i  =  1,  •  ■  •  ,M), 


V'  T~*  m\ i  oc(u)E sinh (F£,m  +  (3r)(T  -  \t\)} 

£  aMEsmcHi-m  +  Far)]  +  -U ^ 

■*1  £  +  + 

2  ”•  1  Jb-f—TXh 


is  just  a  constant.  Since  n(u,  r)  follows  zero-mean  Gaussian  distribution,  so  r,  *bi,' 

of  (27)  follows  Rician  distribution  with  direct  path  value 

A  =  aE[  £  ■»■■*(«  -  m  +  rc,T)]  +  WP ■  -  M)] | 

Tn(FDm  +  fir) 

Since  r  and  are  uncertain  and  zero-mean,  so  we  just  use  the  approximation 


A  =  aE 


which  is  obtained  when  r  and  Fpi  equal  to  0. 

Let  ym  =  \Zm(u;  r,  Fp1 ,  •  •  •  ,  Fdm) |,  then  the  probability  density  function  (pdf)  of  ym  is 

a  \  2 ym  r  {Vm  d"  A^ ) I t  / 2Aym .  /.n\ 

f{ym)  =  -^r  exP[ — ^2 — l7o(— ^2— )  (40) 

where  a2  is  the  noise  power  (with  I  and  Q  sub-channel  power  a2/ 2),  and  Iq(-)  is  the  zero-order 
modified  Bessel  function  of  the  first  kind.  Let  y  =  [2/1,3/21  •  •  •  ,3/m],  then  the  pdf  of  y  is 


/( y)  =  n  /fo™) 


The  ML-ATR  algorithm  to  decide  a  target  category  C  based  on  y  can  be  expressed  as, 


arg  max  /(y|A  =  £|an|) 

71=1, •••  ,N 


arg  max  TT  — ^  exp[— 

n=l,-,/V  11  CT2  1 


2 3/m _ r  (3/m  T  ■®2an) i  r  (-2£'|an|3/„ 
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5  Simulations 


Radar  sensor  networks  will  be  required  to  detect  a  broad  range  of  target  classes.  In  this  paper,  we 
applied  our  ML-ATR  to  automatic  target  recognition  with  delay-doppler  uncertainty.  We  assume 
that  the  domain  of  target  classes  is  known  a  priori  (N  in  Sections  4.1  and  4.2),  and  that  the  RSN 
is  confined  to  work  only  on  the  known  domain. 

For  fluctuating  target  recognition,  our  targets  have  6  classes  with  different  RCS  values,  which  are 
summarized  in  Table  1  [18] .  We  assume  the  fluctuating  targets  follow  “Swerling  2”  model  (Rayleigh 
with  pulse-to-pulse  decorrelation),  and  assume  the  RCS  value  listed  in  Table  1  to  be  the  standard 
deviation  (std)  of  RCS  an(u)  for  target  n.  We  applied  the  ML-ATR  algorithm  in  Section  4.1  (for 
fluctuating  target  case)  for  target  recognition  within  the  six  targets  domain.  We  chose  T  —  0.1  ms 
and  (3  =  106.  At  each  average  SNR  value,  we  ran  Monte-Carlo  simulations  for  105  times  for  each 
target.  In  Fig.  2(a) (b)(c),  we  plot  the  average  ATR  error  for  fluctuating  targets  with  different 
delay-doppler  uncertainty  and  compared  the  performances  of  single  radar  system,  5-radar  RSN, 
and  10-radar  RSN.  Observe  these  three  figures: 

1.  The  two  RSNs  vastly  reduce  the  ATR  error  comparing  to  a  single  radar  system  in  ATR 
with  delay-doppler  uncertainty,  e.g.,  the  10-radar  RSN  can  achieve  ATR  error  2%  comparing 
against  the  single  radar  system  with  ATR  error  37%  at  SNR  —  32 dB  with  delay-doppler 
uncertainty  r  £  [— 0.1T,  0.1T]  and  Fq.  £  [-200Hz,  20077 z). 

2.  Our  LFM  waveform  design  can  tolerate  reasonable  delay-doppler  uncertainty  which  are  tes¬ 
tified  by  Fig.  2(b) (c). 

3.  According  to  Skolnik[18],  radar  performance  with  probability  of  recognition  error  (pe)  less 
than  10%  is  good  enough.  Our  10-radar  RSN  with  waveform-diversity  can  have  probability 
of  ATR  error  much  less  than  10%  for  the  average  ATR  for  all  targets.  However,  the  single 
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radar  system  has  probability  of  ATR  error  much  higher  than  10%.  Our  RSN  with  waveform 
diversity  is  very  promising  to  be  used  for  real-world  ATR. 

4.  Observe  Fig.  2(a) (c),  the  average  probability  of  ATR  error  in  2(c)  is  not  as  sensitive  to 
the  SNR  as  that  in  2(a),  i.e.,  ATR  error  curve  slope  becomes  flat  with  higher  delay-doppler 
uncertainty,  which  means  the  delay-doppler  uncertainty  can  dominate  the  ATR  performance 
when  it’s  too  high. 

For  non-fluctuating  target  recognition,  our  targets  have  6  classes  with  different  RCS  values, 
which  are  summarized  in  Table  1[18].  We  applied  the  ML- ATR  algorithms  in  Section  4.2  (for 
nonfluctuating  target  case)  to  classify  an  unknown  target  as  one  of  these  6  target  classes.  We  chose 
T  —  0.1ms  and  ft  =  106.  At  each  average  SNR  value,  we  ran  Monte-Carlo  simulations  for  105 
times  for  each  target.  In  Fig.  3(a)(b)(c),  we  plotted  the  probability  of  ATR  error  with  different 
delay-doppler  uncertainty.  Observe  these  figures, 

1.  The  two  RSNs  tremendously  reduce  the  ATR  error  comparing  to  a  single  radar  system  in  ATR 
with  delay-doppler  uncertainty,  e.g.,  the  10-radar  RSN  can  achieve  ATR  error  9%  comparing 
against  the  single  radar  system  with  ATR  error  22%  at  SNR  =  22 cLB  with  delay-doppler 
uncertainty  t  G  [-0.2T,0.2T]  and  FDi  €  [-500tfz,  500Hz}. 

2.  Comparing  Fig.  2(a)(b)(c)  against  Fig.  3(a)(b)(c),  the  gain  of  10-radar  RSN  for  fluctuating 
target  recognition  is  much  larger  than  that  for  non-fluctuating  target  recognition,  which  means 
our  RSN  has  better  capacity  to  handle  the  fluctuating  targets.  In  real  world,  fluctuating 
targets  are  more  meaningful  and  realistic. 

3.  Comparing  Fig.  3(a)  (b)(c)  against  Fig.  2(a)  (b)(c),  the  ATR  needs  much  lower  SNR  for 
nonfluctuating  target  recognition  because  Rician  distribution  has  direct  path  component. 
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6  Conclusions  and  Future  Works 

We  have  studied  LFM  waveform  design  and  diversity  in  radar  sensor  networks  (RSN).  We  showed 
that  the  LFM  waveforms  can  co-exist  if  the  carrier  frequency  spacing  is  1/T  between  two  radar 
waveforms.  We  made  analysis  on  interferences  among  waveforms  in  RSN  and  proposed  a  RAKE 
structure  for  waveform  diversity  combining  in  RSN.  We  applied  the  RSN  to  automatic  target 
recognition  (ATR)  with  delay-doppler  uncertainty  and  proposed  maximum-likehood  (ML)-ATR 
algorithms  for  fluctuating  target  and  non-fluctuating  target.  Simulation  results  show  that  RSN 
using  our  waveform  diversity-based  ML-ATR  algorithm  performs  much  better  than  single  radar 
system  for  fluctuating  targets  and  nonfluctuating  targets  recognition.  It  is  also  demonstrated  that 
our  LFM  waveform-based  RSN  can  handle  the  delay-doppler  uncertainty  which  quite  often  happens 
for  ATR  in  target  search  phase. 

The  waveform  design  and  diversity  algorithms  proposed  in  this  paper  can  also  be  applied  to 
active  RFID  sensor  networks  and  underwater  acoustic  sensor  networks  because  LFM  waveforms 
can  also  be  used  by  these  active  sensor  networks  to  perform  collaborative  monitoring  tasks.  In 
this  paper,  the  ATR  is  for  single  target  recognition.  We  will  continuously  investigate  the  ATR 
when  multiple  targets  co-existence  in  RSN  and  each  target  has  delay-doppler  uncertainty.  In  our 
waveform  diversity  combining,  we  have  used  spatial  diversity  combining  in  this  paper.  We  will 
further  investigate  spatial-temporal-frequency  combining  for  RSN  waveform  diversity. 
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Table  1:  RCS  values  at  microwave  frequency  for  6  targets. 


Index  n 

Target 

RCS 

1 

Small  single-engine  aircraft 

1 

2 

Large  flighter  aircraft 

6 

3 

Medium  bomber  or  jet  airliner 

20 

4 

Large  bomber  or  jet  airliner 

40 

5 

Jumbo  jet 

6 

Pickup  truck 

20 
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Figure  1:  Waveform  diversity  combining  by  clusterhead  in  RSN. 
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Figure  2:  The  average  probability  of  ATR  error  for  6  fluctuating  targets  with  different  delay-doppler 
uncertainty,  (a)  no  delay-doppler  uncertainty,  (b)  with  delay-doppler  uncertainty,  r  £  [— 0.1T,  0.1T] 
and  FjDi  £  [-200Hz,200Hz],  and  (c)  with  delay-doppler  uncertainty,  r  £  [— 0.2T,  0.2T]  and  Fqi  £ 
{-b00Hz,500Hz\. 
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Figure  3:  The  average  probability  of  ATR  error  for  6  nonfluctuating  targets  with  different 
delay-doppler  uncertainty,  (a)  no  delay-doppler  uncertainty,  (b)  with  delay-doppler  uncertainty, 
r  €  [— 0.1T,  0.1T]  and  Fd{  €  [-200Hz,200Hz\,  and  (c)  with  delay-doppler  uncertainty,  r  € 
[— 0.2T,  0.2T]  and  FDi  €  [-500Hz,500Hz], 
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Abstract — Automatic  target  recognition  (ATR)  in  target 
search  phase  is  very  challenging  because  the  target  range 
and  mobility  are  not  yet  perfectly  known,  which  results  in 
delay-doppler  uncertainty.  In  this  paper,  we  firstly  perform 
some  theoretical  studies  on  radar  sensor  network  (RSN) 
design  based  linear  frequency  modulation  (LFM)  waveform: 
(1)  the  conditions  for  waveform  co-existence,  (2)  interferences 
among  waveforms  in  RSN,  (3)  waveform  diversity  in  RSN. 
Then  we  apply  RSN  to  ATR  with  delay-doppler  uncertainty 
and  propose  maximum-likekihood  (ML)  ATR  algorithms 
for  fluctuating  target  and  nonfluctuating  target.  Simulation 
results  show  that  our  RSN  vastly  reduces  the  ATR  error 
comparing  to  a  single  radar  system  in  ATR  with  delay- 
doppler  uncertainty. 

I.  Introduction  and  Motivation 

The  goal  for  any  target  recognition  system  is  to  give 
the  most  accurate  interpretation  of  what  a  target  is  at  any 
given  point  in  time.  There  are  two  classes  of  motion  models 
of  target,  one  for  maneuvering  targets  and  one  for  non¬ 
maneuvering  (constant  velocity  and  acceleration)  targets. 
The  area  that  is  still  lacking  in  target  recognition  is  the  abil¬ 
ity  to  detect  reliably  when  a  target  is  beginning  a  maneuver 
where  its  speed  and  range  are  uncertain.  The  tracking 
system  can  switch  the  algorithms  applied  to  the  problem 
from  a  non-maneuvering  set  to  the  maneuvering  set  when 
a  target  is  beginning  a  maneuver.  But  when  the  tracker 
does  finally  catch  up  to  the  target  after  the  maneuver  and 
then  perform  ATR,  the  latency  is  too  high.  In  time  critical 
mission  situation,  such  latency  in  ATR  is  not  tolerable.  In 
this  paper,  we  are  interested  in  studying  automatic  target 
recongition  with  range  and  speed  uncertainty,  i.e.,  delay- 
doppler  uncertainty,  using  radar  sensor  networks  (RSN). 
The  network  of  radar  sensors  should  operate  with  multiple 
goals  managed  by  an  intelligent  platform  network  that  can 
manage  the  dynamics  of  each  radar  to  meet  the  common 
goals  of  the  platform,  rather  than  each  radar  to  operate 
as  an  independent  system.  Therefore,  it  is  significant  to 
perform  signal  design  and  processing  and  networking  co¬ 


operatively  within  and  between  platforms  of  radar  sensors 
and  their  communication  modules.  In  this  paper,  we  are 
interested  in  studying  algorithms  on  radar  sensor  network 
(RSN)  design  based  linear  frequency  modulation  (LFM) 
waveform:  (1)  the  conditions  for  waveform  co-existence, 
(2)  interferences  among  waveforms  in  RSN,  (3)  waveform 
diversity  in  RSN.  Then  we  apply  RSN  to  automatic  target 
recognition  (ATR)  with  delay-doppler  uncertainty. 

In  nature,  diverse  waveforms  are  transmitted  by  animals 
for  specific  applications.  For  example,  when  a  bat  and  a 
whale  are  in  the  search  mode  for  food,  they  emit  a  different 
type  of  waveform  than  when  they  are  trying  to  locate  their 
prey.  The  Doppler-invariant  waveforms  that  they  transmit 
are  environment  dependent  [8].  Hence,  in  RSN,  it  may  be 
useful  to  transmit  different  waveform  from  different  neigh¬ 
bor  radars  and  they  can  collaboratively  perform  waveforms 
diversity  for  ATR.  Sowelam  and  Tewfik  [21]  developed 
a  signal  selection  strategy  for  radar  target  classification, 
and  a  sequential  classification  procedure  was  proposed  to 
minimize  the  average  number  of  necessary  signal  trans¬ 
missions.  Intelligent  waveform  selection  was  studied  in 
[1][9],  but  the  effect  of  doppler  shift  was  not  considered. 
In  [12],  the  performance  of  constant  frequency  (CF)  and 
linear  frequency  modulated  (LFM)  waveform  fusion  from 
the  standpoint  of  the  whole  system  was  studied,  but  the 
effects  of  clutter  was  not  considered.  In  [20],  CF  and 
LFM  waveforms  were  studied  for  sonar  system,  but  it  was 
assumed  that  the  sensor  is  nonintelligent  (i.e.,  waveform 
can’t  be  selected  adaptively).  All  the  above  studies  and 
design  methods  were  focused  on  the  waveform  design  or 
selection  for  a  single  active  radar  or  sensor.  In  [10],  CF 
waveform  design  was  applied  to  RSN  with  application  to 
ATR  without  any  delay-doppler  uncertainty.  In  this  paper, 
we  will  focus  on  the  waveform  design  fusion  for  radar 
sensor  networks  using  LFM  waveform. 

The  rest  of  this  paper  is  organized  as  follows.  In  Section 
II,  we  study  the  co-existence  of  LFM  radar  waveforms.  In 
Section  III,  we  analyze  the  interferences  among  LFM  radar 


waveforms.  In  Section  IV  we  propose  a  RAKE  structure 
for  waveform  diversity  combining  and  propose  maximum- 
likelihood  (ML)  algorithms  for  automatic  target  recogni¬ 
tion  (ATR)  with  delay-doppler  uncertainty.  In  Section  V, 
we  provide  simulation  results  on  ML-ATR  with  delay- 
doppler  uncertainty.  In  Section  VI,  we  conclude  this  paper 
and  provide  some  future  works. 


If  we  choose 

Si  =  £  (5) 

where  i  is  a  dummy  index,  then  (4)  can  have  two  cases 

rT/2  { R  i  —  n 

/  Xi(t)x*n(t  )dt  =  \  (6) 

J-T/2  (0  i^n 


II.  CO-EXISTENCE  OF  LFM  RADAR  WAVEFORMS 
In  radar  sensor  networks  (RSN),  radar  sensors  will 
interfere  with  each  other  and  the  signal-to-interference- 
ratio  may  be  very  low  if  the  waveforms  are  not  properly 
designed.  We  will  introduce  orthogonality  as  one  criterion 
for  waveforms  design  in  RSN  to  make  them  co-existence. 
Besides,  the  radar  channel  is  narrow-band,  so  we  will  also 
consider  the  bandwidth  constraint. 

In  our  radar  sensor  networks,  we  choose  linear  frequency 
modulation  (LFM)  waveform.  The  LFM  waveform  can  be 
defined  as 

x{t)  =  y|exp(j27r/3f2)  -  T/2  <  f  <  T/2  (1) 


So  choosing  <5*  =  ^  in  (3)  can  have  orthogonal  waveforms, 
i.e.,  the  waveforms  can  co-exist  if  the  carrier  spacing 
is  1/T  between  two  radar  waveforms,  i.e.,  orthogonality 
amongst  carriers  can  be  achieved  by  separating  the  carriers 
by  an  interger  multiple  of  the  inverse  of  waveform  pulse 
duration.  With  this  design,  all  the  orthogonal  waveforms 
can  work  simultaneously.  However,  there  may  exist  time 
delay  and  doppler  shift  ambiguity  which  will  have  inter¬ 
ferences  to  other  waveforms  in  RSN. 

III.  Interferences  of  LFM  Waveforms  In  Radar 
Sensor  Networks 

A.  RSN  with  Two  Radar  Sensors 


In  radar,  ambiguity  function  (AF)  is  an  analytical  tool  for 
waveform  design  and  analysis  that  succinctly  characterizes 
the  behavior  of  a  waveform  paired  with  its  matched  filter. 
The  ambiguity  function  is  useful  for  examing  resolution, 
side  lobe  behavior,  and  ambiguities  in  both  range  and 
Doppler  for  a  given  waveform[16].  For  a  single  radar, 
the  matched  filter  for  waveform  x(t)  is  and  the 

ambiguity  function  of  LFM  waveform  is[  1 6] 
rT/ 2 

/  x(t)  exp  (j2trF£>t)x*(t  —  r)dt 
J-T/2+r 

J5sin[7r(F£)  +  0t)(T  —  |r| 


A(t,Fd)  = 


We  are  interested  in  analyzing  the  interference  from  one 
radar  to  another  if  there  exist  time  delay  and  doppler  shift. 

For  a  simple  case  where  there  are  two  radar  sensors  (i 
and  n),  the  ambiguity  function  of  radar  i  (considering 
interference  from  radar  n)  is 

Ai(ti,tn,FDi,  Fpn)  (7) 

/OO 

[xi(t)  exp{j2nFDit)  +  xn{t  -  tn)  exp{j2nFDrit)\ 

•OO 

(8) 


—  T  <  r  <(2) 


Ttt(Fd  +  0  t) 

However,  the  above  ambiguity  is  for  one  radar  only  (no 
co-existing  radar). 

For  radar  sensor  networks,  the  waveforms  from  differ¬ 
ent  radars  will  interfere  with  each  other.  We  choose  the 
waveform  for  radar  i  as 

~E 


xi(t)  =  \j  ^exp[j2n(0t2+6it)}  -T/2  <  t  <  T/2  (3) 


< 


+ 


x* (t  -  ti)dt\ 

/T/2+min(tj,t„) 

xn {t  -  tn)  exp(j2irFDnt)x* ( t  -  ti)dt 


/- 


T/2+max(ii,tTV) 

T/2 

Xi(t)  exp  (j2TrFDit)xl(t  —  ti)dt  (9) 

-r/2+ti 

/T/2+min(ii,t„) 

xn{t  -  tn)exp(j2nFDnt)x*{t -U)dt 

-T/2+ n\a.x(ti,tn) 

Esm[n{FDi  +  0U){T  -  |t 


Tn(FDi  +  0U) 


(10) 


which  means  there  is  a  frequency  shift  <5*  for  radar  i.  To 
minimize  the  interference  from  one  waveform  to  the  other, 
optimal  values  for  <5j  should  be  determined  to  have  the 
waveforms  orthogonal  to  each  other,  i.e.,  let  the  cross¬ 
correlation  between  Xi(t)  and  xn(t)  be  0, 

fT/2 

/  Xi{t)x*n{t)dt 
J-T/2 
rT/2 

=  —  /  exp{j2Tr(0t2  +  5it)}exp[-j2n(0t2  +  5nt)\dt 

1  J-T/2 

=  £sinc[7r(^  -  5n)T\  (4) 


To  make  analysis  easier,  we  assume  £*  =  tn  =  t,  then  (10) 
can  be  simplified  as 


Ai(T,FDi,FDn) 


+ 


Esmc[-n(n  —  i  +  FduT)\  \ 
J5sin[7r(FDi  +  0t)(T  -  |r|)] 


Tn{FDi  +  0t) 


1) 


B.  RSN  with  M  Radar  Sensors 

It  can  be  extended  to  an  RSN  with  M  radars.  Assuming 
time  delay  r  for  each  radar  is  the  same,  then  the  ambiguity 
function  of  radar  1  (considering  interferences  from  all  the 
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other  M  —  1  radars  with  CF  pulse  waveforms)  can  be 
expressed  as 

M 

Ai  (r,  FDl ,  •  •  •  ,  FDm  )  «  ^  £sinc[7r(i  -  1  +  F^T)] 

i=2 

+ 

IV.  Application  to  Automatic  Target 
Recognition  (ATR)  with  Delay-Doppler 
Uncertainty 

In  RSN,  The  radar  sensors  are  networked  together  in  an 
ad  hoc  fashion.  They  do  not  rely  on  a  preexisting  fixed 
infrastructure,  such  as  a  wireline  backbone  network  or 
a  base  station.  They  are  self-organizing  entities  that  are 
deployed  on  demand  in  support  of  various  events  surveil¬ 
lance,  battlefield,  disaster  relief,  search  and  rescue,  etc. 
Scalability  concern  suggest  a  hierarchical  organization  of 
radar  sensor  networks  with  the  lowest  level  in  the  hierarchy 
being  a  cluster.  As  argued  in  [1 1]  [7]  [6]  [14],  in  addition  to 
helping  with  scalability  and  robustness,  aggregating  sensor 
nodes  into  clusters  has  additional  benefits: 

1)  conserving  radio  resources  such  as  bandwidth; 

2)  promoting  spatial  code  reuse  and  frequency  reuse; 

3)  simplifying  the  topology,  e.g.,  when  a  mobile  radar 
changes  its  location,  it  is  sufficient  for  only  the 
nodes  in  attended  clusters  to  update  their  topology 
information; 

4)  reducing  the  generation  and  propagation  of  routing 
information;  and, 

5)  concealing  the  details  of  global  network  topology 
from  individual  nodes. 

In  RSN,  each  radar  can  provide  their  waveform  parameters 
such  as  5i  to  their  clusterhead  radar,  and  the  clusterhead 
radar  can  combine  the  waveforms  from  its  cluster  mem¬ 
bers. 

In  RSN  with  M  radars,  the  received  signal  for  cluster- 
head  (assume  it’s  radar  1)  is 

M 

n  CM)  =  y^Q '.{u)xj(t  -  ti)exp(j2-KFDit)  +  n{u,t) 

i=i 

(13) 

where  a(u)  stands  for  radar  cross  section  (RCS)  and  can  be 
modeled  using  non-zero  constants  for  nonfluctuating  target 
and  four  Swerling  target  models  for  fluctuating  target]  16]; 
F^i  is  the  doppler  shift  of  target  relative  to  waveform  i\ 
ti  is  delay  of  waveform  i,  and  n(u,t)  is  additive  white 
Gaussian  noise  (AWGN).  In  this  paper,  we  propose  a 
RAKE  structure  for  waveform  diversity  combining,  as 
illustrated  by  Fig.  1. 

According  to  this  structure,  the  received  r\(u,t)  is 
processed  by  a  bank  of  matched  filters,  then  the  output 


Esm[iT(FDl  +  @t)(T  ■ 
Ttt(Fd1  +  0t) 


Fig.  1.  Waveform  diversity  combining  by  clusterhead  in  RSN. 


of  branch  1  (after  integration)  is 

, tj MjFdj,--  -  ,Fqm) | 

fT/2 

—  /  ri(u,  t)xl(t  —  ti)ds  (14) 

J-T/2 

/T/2  M 

[y'ai(ti)xi(t  -  ti)  exp{j2irFDit) 

T/2  i=1 

+n(u,t)}xl(t  —  ti)dt\  (15) 

where  jT!^,2n{u,t)x\(t  -  t\)dt  can  easily  be  proved  to 
be  AWGN,  let 

A  fT/2 

n(u,ti)  =  /  n{u,t)x\{t  -  t\)dt  (16) 

J-T/2 

follows  Rayleigh  distribution.  Assuming  1 1  =  1 2  =  ■  ■  ■ 
tM  =  t,  then  based  on  (12), 

\Zi(u;t,FDi  ,  •  •  •  ,FDm)\ 

m 

«  |  ^  a(u)E'sinc[7r(t  -  1  +  Fp.T)] 

+  aT">£sit^.+4T'M)1+"(“'^7) 

Similarly,  we  can  get  the  output  for  any  branch  m  (m  = 

1,2,...,M), 

\Zj n(u,  t,  Fd1  ,  •  •  •  ,FpM) | 


«  I  ^  aCu)-£'sinc[7r(*  —  m  +  FoiT)) 

So  \Zm(u-,T,  Fd1}  ■  ■  ■  ,Fdm)\  consists  of  three  parts, 
signal  (reflected  signal  from  radar  m  waveform): 

|  a(n)gsin[7r(Fpm+ffT)(T-|rl)j  interferences  from  other 

waveforms:  |a(w)Fsinc[7r(i  -  m  +  FDtT)}\, 

and  noise:  |n(u,r)|.  Delay-doppler  uncertainty  happens 
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quite  often  in  target  search  and  recognition  where  target 
range  and  velocity  are  not  yet  perfectly  known. 

We  can  also  have  three  special  cases  for 
Zm  (*Uj  T,  T'l) j  ,  ’  '  '  ,  l[ J m  )  I  • 

1)  When  FDl  =  •  •  •  =  FDm  =  0, 

|Zm(u;T,0,0,  •••  ,0)| 

«  i°(“)gHy -m)i+„(„iT)I(I9) 

2)  If  r  =  0,  then  (18)  becomes 

\Zm{u-,0,FDl,-  ■  ■  ,FDm)\ 

M 

«  |  ^  a(u)Esinc[7r(«  —  m  +  F^T)]  +  n((2jp) 

1=1 

3)  If  r  =  0,  and  FDl  =  •  •  •  =  FDm  =  0,  then  (18) 
becomes 

|Zm(u;  0,0,0,  •••  ,0)|  «  |  Ea(u)  +n(w)|  (21) 

How  to  combine  all  the  Zm's  (m  =  1, 2,  •  •  •  ,  M)  are 
very  similar  to  the  diversity  combining  in  communations 
to  combat  channel  fading,  and  the  combination  schemes 
may  be  different  for  different  applications.  In  this  paper, 
we  are  interested  in  applying  RSN  waveform  diversity 
to  automatic  target  recognition  (ATR),  e.g.,  recognition 
that  the  echo  on  a  radar  display  is  that  of  an  aircraft, 
ship,  motor  vehicle,  bird,  person,  rain,  chaff,  clear-air 
turbulence,  land  clutter,  sea  clutter,  bare  montains,  forested 
areas,  meteors,  aurora,  ionized  media,  or  other  natural 
phenomena.  Early  radars  were  “blob”  detectors  in  that  they 
detected  the  presence  of  a  target  and  gave  its  location 
in  range  and  angle,  and  radar  began  to  be  more  than  a 
blob  detector  and  could  provide  recognition  of  one  type 
of  target  from  another[18].  It  is  known  that  small  changes 
in  the  aspect  angle  of  complex  (multiple  scatter)  targets 
can  cause  major  changes  in  the  radar  cross  section  (RCS). 
This  has  been  considered  in  the  past  as  a  means  of  target 
recognition,  and  is  called  fluctuation  of  radar  cross  section 
with  aspect  angle,  but  it  has  not  had  much  success[18].  In 
this  paper,  we  propose  a  maximum  likelihood  automatic 
target  recognition  (ML-ATR)  algorithm  for  RSN.  We  will 
study  both  fluctuating  target  and  non-fluctuating  target. 

A.  ML-ATR  for  Fluctuating  Targets  with  Delay-Doppler 
Uncertainty 

Fluctuating  target  modeling  is  more  realistic  in  which 
the  target  RCS  is  drawn  from  either  the  Rayleigh  or 
chi-square  of  degree  four  pdf.  The  Rayleigh  model  de¬ 
scribes  the  behavior  of  a  complex  target  consisting  of 
many  scatters,  none  of  which  is  dominant.  The  fourth- 
degree  chi-square  models  targets  having  many  scatters 
of  similar  strength  with  one  dominant  scatter.  Based  on 


different  combinations  of  pdf  and  decorrelation  character¬ 
istics  (scan-to-scan  or  pulse-to-pulse  decorrelation),  four 
Swerling  models  are  used[16].  In  this  paper,  we  will  focus 
on  “Swerling  2”  model  which  is  Rayleigh  distribution  with 
pulse-to-pulse  decorrelation.  The  pulse-to-pulse  decorre¬ 
lation  implies  that  each  individual  pulse  results  in  an 
independent  value  for  RCS  a. 

For  Swerling  2  model,  the  RCS  |a(u)|  follows  Rayleigh 
distribution  and  its  I  and  Q  subchannels  follow  zero-mean 
Gaussian  distributions  with  variance  72.  Assume 

a{u)  =  ai(u)  +  jaQ{u)  (22) 

and  n(u)  —  ni(u)  +  jriQ(u)  follows  zero-mean  complex 
Gausian  distribution  with  variance  a2  for  the  I  and  Q  sub¬ 
channels.  Observe  (18),  for  given  r,  Fd,  (i  =  1,  •  •  ■  ,  M), 


a(u)Esmc[ir(i  —  m  +  FoiT)} 

a(u)E  sin[7r(JF£)m  +  /3r)(T  -  |r|)] 

Tn(FDm  +  pr) 

M 

=  »(«>£!  £  sinc[7r(i  -  m  +  FDiT)\ 

i=l,  i^m 

,  sin[7r(FDm  +/3t)(T-  |t|)]1 

Tn(FDm+Pr)  1  } 

follows  zero-mean  complex  Gaussian  distributions  with 
variance  SVCC^j,^  sinc[7r(z  -  m  +  FDiT )]  + 
S'n^T‘°(Fol3T+0T)^T^  ft  f°r  1  ant*  Q  subchannels.  Since 
n(u,T )  also  follows  zero-mean  Gaussian  distribution,  so 
\Zm(u; r,  FDl ,  •  '•  •  ,  FDm )|  of  (18)  follows  Rayleigh  distri¬ 
bution.  In  real  world,  the  perfect  values  of  r  and  Fot  are 
not  known  in  the  target  search  phase  and  the  mean  values 
of  r  and  Fd,  are  0,  so  we  just  assume  the  parameter  of 
this  Rayleigh  distribution  b  =  \J E2 ■y2  +  a2  (when  r  and 
Fd,  equal  to  0). 

Let  ym  =  \Zm(u;  t,FDi,--  ,  FDm)  |,  then 

f(ym}  =  £272  +  a2  eXP(~£272m+(T2)  (24) 

The  mean  value  of  ym  is  yj- anti  variance  is 
(4— ir)(£  7  varjance  0f  sjgnal  js  (4-x)£  7  an(j 

the  variance  of  noise  is  _ 


Let  y  =  [yi,  V2,  •  ■  •  ,  Vm],  then  the  pdf  of  y  is 


M 

/( y)  =  n  tty™)  (25) 

m= 1 

Our  ATR  is  a  multiple-category  hypothesis  testing  prob¬ 
lem,  i.e.,  to  decide  a  target  category  (e.g.  different  aircraft, 
motor  vehicle,  etc)  based  on  r\{u,t).  Assume  there  are 
totally  N  categories  and  category  n  target  has  RCS  an(u) 


4 


(with  variance  72),  so  the  ML-ATR  algorithm  to  decide  a 
target  category  C  can  be  expressed  as, 


C  =  arg  max  /(y|7  =  7n) 
n= ,N 


(26) 


M  2 


The  ML-ATR  algorithm  to  decide  a  target  category  C 
based  on  y  can  be  expressed  as, 


C  =  arg  max  /(y|A  =  E\an\) 

n— 1,—  ,N 

n^Um  \  {Vm  -®2^n)ir  /  2 £7 1 Oiyx 1 2/m  ^ 

—  exp[ - ^ - ]Io(  —2  ) 

m— 1 


5.  ML-ATR  for  Non-fluctuating  Targets  with  Delay- 
Doppler  Uncertainty 

In  some  sources,  the  non-fluctuating  target  is  identified 
as  “Swerling  0”  or  “Swerling  5”  model[19].  For  non¬ 
fluctuating  target,  the  RCS  a(u)  is  just  a  constant  a 
for  a  given  target.  Observe  (18),  for  given  r,  Fpi  (i  = 
V",M), 


V.  Simulations 

Radar  sensor  networks  will  be  required  to  detect  a  broad 
range  of  target  classes.  In  this  paper,  we  applied  our  ML- 
ATR  to  automatic  target  recognition  with  delay-doppler 
uncertainty.  We  assume  that  the  domain  of  target  classes 
is  known  a  priori  ( N  in  Sections  IV-A  and  IV-B),  and  that 
the  RSN  is  confined  to  work  only  on  the  known  domain. 


M 


+ 


T;  ct(u)Esinc[ir(i  —  m  +  FoiT)] 
a(u)Esin[Tt(FDm  +  p r)(T  -  |r|)j 


Tn(FDm  +  pr) 


(27) 


M 


=  qE[  ^  sinc[7r(i  -  m  +  FD.T)] 

i=l,  ij£m 

sin[7r(FDm  +  /3r)(T  -  |r|)] 
Tit{FDrn  +  Pt)  1 


(28) 


is  just  a  constant.  Since  n(u,r)  follows  zero-mean 
Gaussian  distribution,  so  \Zm(um,  r,  •  •  •  ,  Fdm)\  of 
(18)  follows  Rician  distribution  with  direct  path  value 

M 

A  =  aE{  ^2  sinc[7r(i  —  m  +  F^T)] 

i=l 

,  sink  (FDm  +  Pt){T  —  |t|)]  . 

T7r(FDm+/0r)  J 

Since  r  and  Fpi  are  uncertain  and  zero-mean,  so  we  just 
use  the  approximation 


A  =  aE 


(30) 


which  is  obtained  when  r  and  FD.  equal  to  0. 

Let  ym  =  | Zm(u\T,FDl,---  ,FDm) |,  then  the  proba¬ 
bility  density  function  (pdf)  of  ym  is 


f(Vm) 


^exp 

a *  a 1  a2. 


)  (3D 


where  a 2  is  the  noise  power  (with  I  and  Q  sub-channel 
power  <r2/2),  and  70(-)  is  the  zero-order  modified  Bessel 

function  of  the  first  kind.  Let  y  =  [yi,j/2,-  •  •  ,2/m],  then 
the  pdf  of  y  is 


M 

/( y)  =  n  /(»"*)  (32) 

m=l 


TABLE  I 

RCS  VALUES  AT  MICROWAVE  FREQUENCY  FOR  6  TARGETS. 


Index  n 

Target 

RCS 

1 

Small  single-engine  aircraft 

1 

2 

Large  flighter  aircraft 

6 

3 

Medium  bomber  or  jet  airliner 

20 

4 

Large  bomber  or  jet  airliner 

40 

5 

Jumbo  jet 

100 

6 

Pickup  truck 

200 

For  fluctuating  target  recognition,  our  targets  have  6 
classes  with  different  RCS  values,  which  are  summarized 
in  Table  I[1 8].  We  assume  the  fluctuating  targets  follow 
“Swerling  2”  model  (Rayleigh  with  pulse-to-pulse  decor¬ 
relation),  and  assume  the  RCS  value  listed  in  Table  I  to  be 
the  standard  deviation  (std)  7„  of  RCS  a„(u)  for  target  n. 
We  applied  the  ML-ATR  algorithm  in  Section  IV-A  (for 
fluctuating  target  case)  for  target  recognition  within  the  six 
targets  domain.  We  chose  T  =  0.1ms  and  P  =  106.  At 
each  average  SNR  value,  we  ran  Monte-Carlo  simulations 
for  105  times  for  each  target.  In  Fig.  2(a)(b)(c),  we  plot 
the  average  ATR  error  for  fluctuating  targets  with  different 
delay-doppler  uncertainty  and  compared  the  performances 
of  single  radar  system,  5-radar  RSN,  and  10-radar  RSN. 
Observe  these  three  figures: 

1)  The  two  RSNs  vastly  reduce  the  ATR  error  com¬ 
paring  to  a  single  radar  system  in  ATR  with  delay- 
doppler  uncertainty,  e.g.,  the  10-radar  RSN  can 
achieve  ATR  error  2%  comparing  against  the  single 
radar  system  with  ATR  error  37%  at  SNR  =  32clB 
with  delay-doppler  uncertainty  r  £  [— 0.1T,  0.1T] 
and  Foi  £  [—200772,200772], 

2)  Our  LFM  waveform  design  can  tolerate  reasonable 
delay-doppler  uncertainty  which  are  testified  by  Fig. 
2(b)(c). 

3)  According  to  Skolnik[18],  radar  performance  with 
probability  of  recognition  error  (pe )  less  than  10% 
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is  good  enough.  Our  10-radar  RSN  with  waveform- 
diversity  can  have  probability  of  ATR  error  much 
less  than  10%  for  the  average  ATR  for  all  targets. 
However,  the  single  radar  system  has  probability  of 
ATR  error  much  higher  than  10%.  Our  RSN  with 
waveform  diversity  is  very  promising  to  be  used  for 
real-world  ATR. 

4)  Observe  Fig.  2(a)(c),  the  average  probability  of  ATR 
error  in  2(c)  is  not  as  sensitive  to  the  SNR  as  that 
in  2(a),  i.e.,  ATR  error  curve  slope  becomes  flat 
with  higher  delay-doppler  uncertainty,  which  means 
the  delay-doppler  uncertainty  can  dominate  the  ATR 
performance  when  it’s  too  high. 

For  non-fluctuating  target  recognition,  our  targets  have  6 
classes  with  different  RCS  values,  which  are  summarized 
in  Table  I[  1 8] .  We  applied  the  ML- ATR  algorithms  in 
Section  IV-B  (for  nonfluctuating  target  case)  to  classify  an 
unknown  target  as  one  of  these  6  target  classes.  We  chose 
T  =  0.1ms  and  (3  =  106.  At  each  average  SNR  value,  we 
ran  Monte-Carlo  simulations  for  105  times  for  each  target. 
In  Fig.  3(a)(b)(c),  we  plotted  the  probability  of  ATR  error 
with  different  delay-doppler  uncertainty.  Observe  these 
figures, 

1)  The  two  RSNs  tremendously  reduce  the  ATR  error 
comparing  to  a  single  radar  system  in  ATR  with 
delay-doppler  uncertainty,  e.g.,  the  10-radar  RSN  can 
achieve  ATR  error  9%  comparing  against  the  single 
radar  system  with  ATR  error  22%  at  SNR  =  22 dB 
with  delay-doppler  uncertainty  r  €  [-0.2T,0.2T] 
and  FDi  E  [-500H*,  500Hz], 

2)  Comparing  Fig.  2(a)(b)(c)  against  Fig.  3(a)(b)(c),  the 
gain  of  10-radar  RSN  for  fluctuating  target  recog¬ 
nition  is  much  larger  than  that  for  non-fluctuating 
target  recognition,  which  means  our  RSN  has  better 
capacity  to  handle  the  fluctuating  targets.  In  real 
world,  fluctuating  targets  are  more  meaningful  and 
realistic. 

3)  Comparing  Fig.  3(a)(b)(c)  against  Fig.  2(a)(b)(c), 
the  ATR  needs  much  lower  SNR  for  nonfluctuating 
target  recognition  because  Rician  distribution  has 
direct  path  component. 

VI.  Conclusions  and  Future  Works 

We  have  studied  LFM  waveform  design  and  diversity  in 
radar  sensor  networks  (RSN).  We  showed  that  the  LFM 
waveforms  can  co-exist  if  the  carrier  frequency  spacing 
is  1/T  between  two  radar  waveforms.  We  made  analysis 
on  interferences  among  waveforms  in  RSN  and  proposed  a 
RAKE  structure  for  waveform  diversity  combining  in  RSN. 
We  applied  the  RSN  to  automatic  target  recognition  (ATR) 
with  delay-doppler  uncertainty  and  proposed  maximum- 
likehood  (ML)-ATR  algorithms  for  fluctuating  target  and 
non-fluctuating  target.  Simulation  results  show  that  RSN 


(b) 


Fig.  2.  The  average  probability  of  ATR  error  for  6  fluctuating 
targets  with  different  delay-doppler  uncertainty,  (a)  no  delay-doppler 
uncertainty,  (b)  with  delay-doppler  uncertainty,  r  £  [— 0.1T,  0.1T]  and 
Fpi  £  [—200 Hz,  200 Hz],  and  (c)  with  delay-doppler  uncertainty, 
r  £  [— 0.2T,  0.2T]  and  FDi  £  [~500Hz,500Hz], 


using  our  waveform  diversity-based  ML-ATR  algorithm 
performs  much  better  than  single  radar  system  for  fluc¬ 
tuating  targets  and  nonfluctuating  targets  recognition.  It 
is  also  demonstrated  that  our  LFM  waveform-based  RSN 
can  handle  the  delay-doppler  uncertainty  which  quite  often 
happens  for  ATR  in  target  search  phase. 

In  this  paper,  the  ATR  is  for  single  target  recognition. 
We  will  continuously  investigate  the  ATR  when  multiple 
targets  co-existence  in  RSN  and  each  target  has  delay- 
doppler  uncertainty.  In  our  waveform  diversity  combining, 
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Fig.  3.  The  average  probability  of  ATR  error  for  6  nonfluctuating 
targets  with  different  delay-doppler  uncertainty,  (a)  no  delay-doppler 
uncertainty,  (b)  with  delay-doppler  uncertainty,  r  £  [— 0.1T,  0.1T]  and 
Fni  6  [— 200-f/z,  20077  z],  and  (c)  with  delay-doppler  uncertainty, 
r  e  [— 0.2T,  0.2T]  and  FDi  e  [-50077z,  500/7z]. 

we  have  used  spatial  diversity  combining  in  this  paper.  We 
will  further  investigate  spatial-temporal-frequency  combin¬ 
ing  for  RSN  waveform  diversity. 
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Abstract — In  this  paper,  we  propose  a  Knowledge-based 
Ubiquitous  and  Persistent  Sensor  networks  (KUPS)  for  threat 
assessment,  of  which  “sensor”  is  a  broad  characterization 
concept.  It  means  diverse  data  or  information  from  ubiqui¬ 
tous  and  persistent  sensor  sources  such  as  organic  sensors  and 
human  intelligence  sensors.  Our  KUPS  for  threat  assessment 
consists  of  two  major  steps:  threat  detection  using  fuzzy  logic 
systems  and  threat  parameter  estimation  using  radar  sensor 
networks.  Our  fuzzy  logic  systems  can  combine  the  linguistic 
knowledge  from  different  intelligent  sensors.  We  propose  a 
maximum-likelihood  (ML)  estimation  algorithm  for  target 
RCS  parameter  estimation,  and  we  show  that  our  ML  esti¬ 
mator  is  unbiased  and  the  variance  of  parameter  estimation 
matches  the  Cramer-Rao  lower  bound.  Simulations  further 
validate  these  theoretical  results. 

I.  Introduction  and  Motivation 

In  current  and  future  military  operational  environments, 
such  as  Global  War  on  Terrorism  (GWOT)  and  Maritime 
Domain  Awareness  (MDA),  warfighters  require  technolo¬ 
gies  evolved  to  support  information  needs  regardless  of 
location  and  consistent  with  the  users  level  of  command 
or  responsibility  and  operational  situation.  To  support  this 
need,  the  DoD  has  developed  the  concept  of  Network 
Centric  Warfare  (NCW),  defined  as  “ military  operations 
that  exploit  state-of-the-art  information  and  networking 
technology  to  integrate  widely  dispersed  human  decision 
makers,  situational  and  targeting  sensors,  and  forces  and 
weapons  into  a  highly  adaptive,  comprehensive  system  to 
achieve  unprecedented  mission  effectiveness .” 

Some  works  have  been  reported  on  threat  assessment.  In 
[5],  an  intelligent  threat  assessment  processor  using  genetic 
algorithms  and  fuzzy  logic  was  proposed.  In  [16],  threat 
assessment  was  studied  in  tactical  airborne  environments. 
In  [8],  neural  network  was  applied  to  threat  assessment 
for  automated  visual  surveillance.  In  [4],  an  intelligent 
assistant  to  provide  automatic  situation  and  threat  advice 
in  the  Air  Defence  Ground  Environment  was  proposed. 
In  [1],  a  situation  and  threat  assessment  model  based  on 


group  analysis  was  proposed.  A  situation/threat  assessment 
fusion  system  was  proposed  in  [2],  Other  approaches 
include  multiple  attribute  decision  making[3],  bayesian 
networks[17],  etc. 

Current  shortfalls  in  warfighting  functionality  result 
from  limitations  in  technology.  For  example,  accurate  and 
timely  information  about  battlespace  objects  and  events 
is  not  available  to  support  warfighter  decision  making, 
(includes  reliable  location,  tracking,  combat  identification, 
and  targeting  information).  Too  often,  the  characteristics  of 
objects  that  are  not  of  interest  will  be  similar  to  those  of 
threat  objects.  The  conventional  approach  to  false  alarm 
control  is  to  reduce  sensitivity  of  the  radar  in  areas  of 
clutter,  using  Sensitivity  Time  Control  (STC)  [21],  How¬ 
ever,  this  approach  is  not  effective  for  small  and  slow 
objects.  In  [9],  we  proposed  a  maximum-likelihood  (ML) 
automatic  target  recognition  (ATR)  algorithm  using  con¬ 
stant  frequency  (CF)  waveform  design  and  diversity,  and 
perfect  delay-doppler  are  assumed  in  ATR.  In  [10],  we  ap¬ 
plied  linear  frequency  modulation  (LFM)  waveform  design 
and  diversity  to  ATR  with  delay-doppler  uncertainty.  The 
above  two  approaches  assumed  that  different  targets  in  the 
domain-of-classification  have  different  radar  cross  section 
(RCS),  e.g.,  bird  has  RCS  value  0.01m2  and  conventional 
unmanned  winged  missle  has  RCS  value  0.5 m2.  However, 
different  targets  may  have  the  same  RCS  values,  e.g.,  a  man 
and  a  small  single-engine  aircraft  have  the  same  RCS  value 
lm2  at  microwave  frequencies  [21].  Besides,  different 
threat  targets,  such  as  missile  and  large  aircraft,  may  need 
different  combat  action,  so  we  have  to  classify  the  threat 
target  type.  In  this  paper,  we  propose  Knowledge-based 
Ubiquitous  and  Persistent  Sensor  networks  (KUPS)  for 
Threat  Assessment. 

The  rest  of  this  paper  is  organized  as  follows.  In 
Section  II,  we  introduce  a  new  concept,  Knowledge-based 
Ubiquitous  and  Persistent  Sensor  networks  (KUPS);  in 
Section  III,  we  propose  Situation-Aware  Knowledge-Based 
Threat  Assessment  Using  INT  Sensors;  in  Section  IV,  we 


propose  a  maximum-likelihood  estimation  algorithm  for 
threat  target  RCS  parameter  estimation  using  radar  sensor 
network  and  provide  Monte  Carlo  simulation  results.  In 
Section  V,  we  conclude  this  paper  and  discuss  future 
research. 

II.  Introduction  to  Knowledge-based 
Ubiquitous  and  Persistent  Sensor  networks 
(KUPS):  A  New  Concept 

In  this  paper,  we  propose  a  Network  Centric  Warfare 
model,  Knowledge-based  Ubiquitous  and  Persistent  Sensor 
networks  (KUPS),  of  which  “sensor”  is  a  broad  character¬ 
ization  concept.  It  means  diverse  data  or  information  from 
ubiquitous  and  persistent  sensor  sources  such  as 

•  Organic  sensors  (e.g.,  radar,  electro-optic  and  infrared, 
acoustic,  and  non-acoustic)  deployed  on  air,  ground, 
surface,  or  unattended  platforms. 

•  Signals  Intelligence  (SIGINT)  including  Electronic 
Intelligence  (ELINT)  and  Communication  Intelli¬ 
gence  (COMINT),  for  example,  it  can  assign  mean¬ 
ingful  metadata  to  each  collection,  and  metadata  is 
the  standardized  characterization  of  data  providing 
descriptors  (such  as  stability,  activity,  membership,  or 
structure). 

•  Human  Intelligence  (HUMINT),  e.g.,  to  identify  spe¬ 
cific  people/cells/groups  and  relationships. 

•  Measurement  and  Signatures  Intelligence  (MASINT), 
e.g.,  to  provide  specific  weapon  system  identifica¬ 
tions,  chemical  compositions  and  material  content. 

.  Imagery  Intelligence  (IMINT),  e.g.,  to  track  vehicles 
through  urban  area. 

•  Open  Source  Intelligence  (OSINT),  e.g.,  to  provide 
text  data  collection. 

All  these  sources  of  information  need  to  be  integrated  via 
“sensor  networking”  to  accomplish  a  mission.  In  this  paper, 
we  apply  KUPS  to  threat  assessment,  and  the  organic 
sensors  we  use  are  pulse  doppler  radars. 

Our  KUPS  for  threat  assessment  is  a  hierarchical  archi¬ 
tecture  which  consists  of  two  major  steps: 

1)  Step  1:  Perform  situation-aware  knowledge-based 
threat  assessment  using  INT  sensors  (e.g.  SIGINT, 
HUMINT  sensors).  Fuzzy  rules  are  used  to  rep¬ 
resent  the  linguistic  knowledge  uncertainties  from 
HUMINT  sensors,  and  fuzzy  logic  systems  are  used 
to  perform  knowledge-based  decision  making  on 
threat  assessment  (e.g.,  threat  or  non-threat),  If  it  is 
assessed  as  a  non-threat,  then  stops;  if  it  is  assessed 
as  a  threat,  then  go  to  the  next  step  to  classify  what 
kind  of  target  this  threat  is. 

2)  Step  2:  Perform  target  RCS  value  estimation  us¬ 
ing  radar  sensor  networks.  We  propose  a  maximal- 
likelihood  (ML)  estimation  algorithm  to  estimate 


target  RCS  parameter  value  using  radar  sensor  net¬ 
works.  Based  on  the  estimated  RCS  parameter,  the 
KUPS  will  advise  what  kind  of  target  this  threat 
is.  The  ML  estimation  algorithm  can  help  to  esti¬ 
mate  the  RCS  parameter  9  (parameter  in  Rayleigh 
distribution  for  fluctuating  target).  However,  same 
RCS  parameter  may  mean  different  targets,  threat 
or  non-threat.  For  example,  for  9  =  2,  the  target  can 
be  a  small  flighter  aircraft,  a  small  pleasure  boat,  a 
bicycle[21],  or  any  other  similar  size  target.  So  we 
have  to  use  Step  1  to  make  decision  first,  and  only 
threat  target  requires  further  classification  for  further 
action. 

We  will  discuss  these  two  steps  in  the  following  sections. 

III.  Situation-Aware  Knowledge-Based  Threat 
Assessment  Using  INT  Sensors 
In  knowledge-based  threat  assessment  using  INT  sen¬ 
sors,  fuzzy  rules  are  used  to  represent  the  linguistic  and 
numerical  knowledge  uncertainties  from  INT  sensors,  and 
fuzzy  logic  systems  are  used  to  perform  knowledge-based 
decision  making  on  threat  assessment.  We  give  a  brief 
introduction  on  fuzzy  logic  systems  first. 


A.  Overview  of  Fuzzy  Logic  Systems 

Figure  1  shows  the  structure  of  a  fuzzy  logic  system 
(FLS)  [12],  When  an  input  is  applied  to  a  FLS,  the 
inference  engine  computes  the  output  set  corresponding 
to  each  rule.  The  defuzzifer  then  computes  a  crisp  output 
from  these  rule  output  sets.  Consider  a  p-input  1-output 
FLS,  using  singleton  fuzzification,  center-of-sets  defuzzi¬ 
fication  [14]  and  “IF-THEN”  rules  of  the  form 

Rl  :  IF  x\  is  F'j  and  x2  is  Fz2  and  •  •  •  and  xp  is  flp, 
THEN  p  is  Gz. 


Assuming  singleton  fuzzification,  when  an  input  x'  = 
{x[, . . .  ,x'v}  is  applied,  the  degree  of  firing  corresponding 
to  the  l\h  rule  is  computed  as 


where  a  and  T  both  indicate  the  chosen  t-norm.  There  are 
many  kinds  of  defuzzifiers.  In  this  paper,  we  focus,  for  il¬ 
lustrative  purposes,  on  the  center-of-sets  defuzzifier  [14].  It 
computes  a  crisp  output  for  the  FLS  by  first  computing  the 
centroid,  cG; ,  of  every  consequent  set  Gl,  and,  then  com¬ 
puting  a  weighted  average  of  these  centroids.  The  weight 
corresponding  to  the  fth  rule  consequent  centroid  is  the 
degree  of  firing  associated  with  the  Zth  rule,  {x'fj, 

so  that 


1/cos  (x )  — 


E/=i  V=xtR\(x'i) 


(2) 


where  M  is  the  number  of  rules  in  the  FLS. 
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Fig.  1.  The  structure  of  a  fuzzy  logic  system. 


and  High.  The  consequent  -  the  possibility  that  this  target 
is  a  threat  -  was  divided  into  5  levels,  Very  Strong,  Strong, 
Medium,  Weak,  Very  Weak.  So  we  need  to  set  up  3 3  =  27 
(because  every  antecedent  has  3  fuzzy  sub-sets,  and  there 
are  3  antecedents)  rules  for  this  FLS.  Table  I  summarizes 
the  fuzzy  rules  we  used  in  this  paper.  We  used  trapezoidal 
membership  functions  (MFs)  to  represent  low,  and  high, 
and  triangle  MFs  to  represent  moderate.  We  show  these 
MFs  in  Fig.  2. 


B.  Situation-aware  Knowledge-based  Threat  Assessment 
Using  Fuzzy  Logic  Systems 

In  our  FLS  design  for  threat  assessment,  we  will  con¬ 
sider  the  following  knowledge-based  antecedents: 

1)  The  number  of  switches  from  non-maneuvering  set 
to  the  maneuvering  set.  When  a  target  is  begin¬ 
ning  a  maneuver  from  a  non-maneuvering  class,  the 
tracking  system  can  switch  the  algorithms  applied 
to  the  problem  from  a  non-maneuvering  set  to  the 
maneuvering  set.  The  errors  in  distance  from  where 
the  tracker  estimates  the  position  of  a  target  and 
the  actual  position  can  be  very  large  when  the 
incorrect  motion  models  are  applied  to  the  problem. 
Additionally,  when  the  tracker  does  finally  catch  up 
to  the  target  after  the  maneuver,  the  track  will  ’’jump” 
across  the  operator’s  scope  giving  a  very  unrealistic 
and  unreliable  picture  of  what  that  target  is  actually 
doing.  So  a  threat  target  will  quite  often  switch  from 
a  non-maneuvering  set  to  the  maneuvering  set,  and 
vice  versa,  to  avoid  being  tracked  all  the  time.  This 
knowledge  can  be  used  as  an  antecedent  to  make 
threat  assessment. 

2)  The  frequency  of  appearance  of  such  type  of  target 
based  on  some  a  priori  knowledge  such  as  archival 
radar  data.  Generally  threat  targets  are  new  compar¬ 
ing  to  archival  radar  data. 

3)  The  importance  of  geolocation  of  this  target  based 
on  the  geographical  information  systems  (GISs). 
Examples  of  important  geolocations  include  large 
metroplex,  landmarks,  military  bases,  airport,  etc. 
Threats  happen  quite  often  in  such  areas. 

The  above  three  antecedents  are  all  knowledge-based 
and  it  can  be  collected  from  the  INT  sensors.  A  typical 
rule  using  the  above  three  antecedents  can  be 

IF  the  number  of  switches  from  non-maneuvering  set  to 
the  maneuvering  set  is  High,  and  the  frequency  of 
appearance  of  such  target  is  Low,  and  the  importance  of 
geolocation  of  such  type  of  target  is  High,  THEN  the 
possibility  that  this  target  is  a  threat  is  Very  Strong. 

The  linguistic  variables  used  to  represent  each  an¬ 
tecedent  were  divided  into  three  levels:  Low,  Moderate, 


TABLE  I 

Fuzzy  rules  used  in  the  KUPS.  Ante  1  is  the  number  of 

SWITCHES  FROM  NON-MANEUVERING  SET  TO  THE  MANEUVERING 
SET  OR  VICE  VERSA;  ANTE  2  THE  FREQUENCY  OF  APPEARANCE  OF 
SUCH  TYPE  OF  TARGET;  ANTE  3  IS  THE  IMPORTANCE  OF 
GEOLOCATION  OF  THIS  TARGET;  AND  CONSEQUENT  IS  THE 
POSSIBILITY  THAT  THIS  TARGET  IS  A  THREAT. 


Rule  # 

Ante  1 

Ante  2 

Ante  3 

Consequent 

1 

low 

low 

low 

Weak 

2 

low 

low 

moderate 

Medium 

3 

low 

low 

high 

Strong 

4 

low 

moderate 

low 

Very  Weak 

5 

low 

moderate 

moderate 

Weak 

6 

low 

moderate 

high 

Medium 

7 

low 

high 

low 

Very  Weak 

8 

low 

high 

moderate 

Weak 

9 

low 

high 

high 

Medium 

10 

moderate 

low 

low 

s^U^nSESii! 

11 

moderate 

low 

moderate 

Strong 

12 

moderate 

low 

high 

Very  Strong 

13 

moderate 

moderate 

low 

Weak 

14 

moderate 

moderate 

moderate 

Medium 

15 

moderate 

moderate 

high 

Strong 

16 

moderate 

high 

low 

Very  Weak 

17 

moderate 

high 

moderate 

Weak 

18 

moderate 

high 

Medium 

19 

high 

low 

low 

Medium 

20 

high 

low 

Strong 

21 

high 

low 

high 

Very  Strong 

22 

moderate 

low 

Weak 

23 

high 

moderate 

moderate 

Medium 

24 

high 

moderate 

high 

Strong 

25 

high 

high 

low 

Very  Weak 

26 

high 

high 

moderate 

Weak 

27 

high 

Moderate 

For  every  input  (xi , £2, 23),  the  output  is  computed 
using 


y(xi,x2,x3) 


Tali  )Vr>{x2)VF?{x3)clavg 


By  repeating  these  calculations  for  V.t;  e  [0, 10],  we 
obtain  a  hypersurface  y(xi,  x2,  £3).  Since  it’s  a  4-D  surface 
(xi,x2,xs,y),  it’s  impossible  to  be  plot  visually. 

If  we  have  X3  =  1,  and  two  other  antecedents,  x\  and 
x2  are  variables,  for  every  input  (x\,x2,l),  the  output  is 
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Fig.  2.  The  MFs  used  to  represent  the  linguistic  labels,  (a)  MFs  for 
antecedents,  and  (b)  MFs  for  consequent. 


computed  using 


y{xx,x2,l) 


Efll  MFifol)MF?(z2)MF;'(l)4os 


(4) 


By  repeating  these  calculations  for  Vxi  <E  [0,10]  and 
Vrc2  €  [0,10]  ,  we  obtain  a  hypersurface  y{x\,x 2,1),  as 
plotted  in  Fig.  3(a).  In  contrast,  if  we  have  23  =  9,  and  two 
other  antecedents,  x\  and  22  are  variables,  similarly  we 
obtain  another  surface  3/(11,  x2, 9),  as  plotted  in  Fig.  3(b). 
Observe  Fig.  3ab,  the  importance  of  geolocation  of  a  target 
(x3)  makes  a  big  difference  in  threat  assessment,  and 
the  number  of  switches  from  non-maneuvering  set  to  the 
maneuvering  set  or  vice  versa  (21)  and  the  frequency  of 
appearance  of  such  target  (22)  also  play  very  important 
role  even  when  the  importance  of  geolocation  (2  3)  is  the 
same. 


(a) 


(b) 


Fig.  3.  The  threat  assessment  surface  for  fixed  importance  of  geolocation 
of  this  target  (X3),  (a)  when  x3  =  1,  and  (b)  when  13  =  9. 


IV.  Target  RCS  Value  Estimation  Using  Radar 
Sensor  Networks 

A.  RCS  and  RCS  Voltage  for  Fluctuating  Target 

Most  radar  analysis  and  measurement  programs  em¬ 
phasize  RCS  measurements,  which  are  proportional  to 
received  power.  RCS  is  the  fictional  area  over  which  the 
transmitter  power  density  must  be  intercepted  to  collect 
a  total  power  that  would  account  for  the  received  power 
density.  Typical  values  of  RCS  for  targets  of  interests  range 
from  0.01m2  to  hundreds  of  square  meters[21].  Fluctuating 
target  modeling  is  more  realistic  in  which  the  target  RCS 
is  drawn  from  either  the  Rayleigh/exponential  or  chi- 
square  of  degree  four  pdf.  The  Rayleigh/exponential  model 
describes  the  behavior  of  a  complex  target  consisting  of 
many  scatters,  none  of  which  is  dominant.  The  fourth- 
degree  chi-square  models  targets  having  many  scatters 
of  similar  strength  with  one  dominant  scatter.  Based  on 
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different  combinations  of  pdf  and  decorrelation  character¬ 
istics  (scan-to-scan  or  pulse-to-pulse  decorrelation),  four 
Swerling  models  are  used[20].  In  this  paper,  we  will  focus 
on  “Swerling  2”  model  which  is  exponential  distribu¬ 
tion  with  pulse-to-pulse  decorrelation.  The  pulse-to-pulse 
decorrelation  implies  that  each  individual  pulse  results  in 
an  independent  value  for  RCS.  Sometime  the  RCS  voltage 
value  (square  root  of  RCS),  is  of  interest,  particularly  for 
use  in  simulations  to  model  the  composite  echo  from  a 
multiple  scatterer  target.  The  RCS  voltage  value  is  the 
square  root  of  RCS,  so  the  probability  density  function 
of  RCS  voltage  follows  Rayleigh  distribution[20].  In  this 
paper,  we  apply  radar  sensor  network  to  estimate  the  RCS 
value. 


B.  Introduction  to  Radar  Sensor  Networks 

In  [9],  we  performed  some  theoretical  studies  on  con¬ 
stant  frequency  (CF)  pulse  waveform  design  and  diversity 
in  radar  sensor  networks  (RSN):  (1)  the  conditions  for 
waveform  co-existence,  (2)  interferences  among  wave¬ 
forms  in  RSN,  (3)  waveform  diversity  combining  in  RSN. 

For  radar  sensor  networks,  the  waveforms  from  differ¬ 
ent  radars  will  interfere  with  each  other.  We  choose  the 
waveform  for  radar  i  as 

Xi{t)  =  ^exp[j27r(/3  +  <5;)f]  -  T/2  <  t  <  T/2  (5) 

which  means  there  is  a  frequency  shift  <5;  for  radar  i.  To 
minimize  the  interference  from  one  waveform  to  the  other, 
optimal  values  for  <5;  should  be  determined  to  have  the 
waveforms  orthogonal  to  each  other,  i.e.,  let  the  cross¬ 
correlation  between  Xi(t)  and  xn(t)  be  0,  we  showed  that 
choosing  <5;  =  ^  in  (5)  can  have  orthogonal  waveforms, 
i.e.,  the  waveforms  can  co-exist  if  the  carrier  spacing  is 
l/T  between  two  radar  waveforms. 

In  RSN,  The  radar  sensors  are  networked  together  in  an 
ad  hoc  fashion.  They  do  not  rely  on  a  preexisting  fixed 
infrastructure,  such  as  a  wireline  backbone  network  or 
a  base  station.  They  are  self-organizing  entities  that  are 
deployed  on  demand  in  support  of  various  events  surveil¬ 
lance,  battlefield,  disaster  relief,  search  and  rescue,  etc. 
Scalability  concern  suggest  a  hierarchical  organization  of 
radar  sensor  networks  with  the  lowest  level  in  the  hierarchy 
being  a  cluster.  As  argued  in  [1 1]  [7]  [6]  [18],  in  addition  to 
helping  with  scalability  and  robustness,  aggregating  sensor 
nodes  into  clusters  has  additional  benefits: 

1)  conserving  radio  resources  such  as  bandwidth; 

2)  promoting  spatial  code  reuse  and  frequency  reuse; 

3)  simplifying  the  topology,  e.g.,  when  a  mobile  radar 
changes  its  location,  it  is  sufficient  for  only  the 
nodes  in  attended  clusters  to  update  their  topology 
information; 


4)  reducing  the  generation  and  propagation  of  routing 
information;  and, 

5)  concealing  the  details  of  global  network  topology 
from  individual  nodes. 

In  RSN,  each  radar  can  provide  their  waveform  parameters 
such  as  5 i  to  their  clusterhead  radar,  and  the  clusterhead 
radar  can  combine  the  waveforms  from  its  cluster  mem¬ 
bers. 

In  RSN  with  M  radars,  the  received  signal  for  cluster- 
head  (assume  it’s  radar  1)  is 

M 

n(u,t)  =  y^  a(u)xi(t  -  tj)  exp(j2nFDit)  +  n(u,t)  (6) 

i=l 

where  Xi(t)  is  the  transmitted  CF  waveform;  a(u)  stands 
for  voltage  of  radar  cross  section  (RCS);  Fjji  is  the 
doppler  shift  of  target  relative  to  waveform  i;  fy  is  delay 
of  waveform  i,  and  n(u,  t )  is  additive  white  Gaussian 
noise  (AWGN).  In  [9],  we  proposed  a  RAKE  structure  for 
waveform  diversity  combining,  as  illustrated  by  Fig.  4. 


Fig.  4.  Waveform  diversity  combining  by  clusterhead  in  RSN. 


According  to  this  structure,  the  received  n(ii,t)  is 
processed  by  a  bank  of  matched  filters,  then  the  output 
of  branch  1  (after  integration)  is[9] 


Zi(u\tu-  ■  ■  ,tMlFol ,  ■  •  •  ,Fdm) | 

/•T/2 

/  ri(u,  t)x\{t  —  t\)ds  (7) 

J-T/2 

/T/2  M 

\S^ai{u)xi{t  -  ti)exY>(j2nFDit) 

■t/2  4=1 

+n(u,t)]xl(t  -  ti)dt\  (8) 

where  fT^,2 n(u, f)x[(f  —  t\)dt  can  easily  be  proved  to 
be  AWGN,  so 


KMi)|  = 


,T/2 

/  n(u,  t)x\{t  —  t\)dt 
J-T/2 


(9) 
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follows  Rayleigh  distribution.  Assuming  fx  =  f2  =  •  •  •  = 
tM  =  t,  then  according  to  interference  analysis  in  [9], 


\Zi{u\t,FDi,---  ,FDa 


M 


T,  |a(u)sinc[7r(?  -  1  +  FDiT)]\ 


i= 2 


+ 


a(u)  sm[nFDl(T  -  |r|)] 


TttFDi 


+  |n(u,r)|  (10) 


Similarly,  we  can  get  the  output  for  any  branch  m  (m  = 

1  >  2,  ■  •  -  ,  M), 


+ 


|2'm(u;r,FD1,--'  ,FDm)| 
m 

y\  |a(u)sinc[7r(r  —  m  +  F/^T)]! 
a(u)  sin[7rFDm  (T  -  |r|)] 


TnFD 


+  |n(u,r)|  (11) 


So  \Zm(u\T,  Fp1,  ■  ■  ■  ,Fdm)\  consists  of  three  parts, 
signal  (reflected  signal  from  radar  m  waveform): 
a(u)gsin;TFpm(T-|r|)]  ,  interferences  from  other 

waveforms:  Yiix^m  |a(«)-Bsinc[7r(*  -m  +  FDiT)}\, 

and  noise:  |n(u,r)|. 

We  can  have  three  special  cases  for 
| Zm{u-,T,FDl,  -  ■  ■  ,FDm) |: 

1)  When  FDl=--  =  FDm  =  0, 


Zm{u\  T,  0,  0,  '  •  •  ,0)| 
a(u)(T-\r\)} 


+  |n(u,r)|  (12) 


which  means  if  there  is  no  doppler  mismatch,  there 
will  be  no  interference  from  other  waveform. 

2)  If  r  =  0,  then  (11)  becomes 


I Zm(u-,  0,  Fd i,  •  •  ■  ,  FDa 


m 


+ 


y  |a(u)sinc[7r(i  -  m  +  F^jT)]! 
a(u)  sin[7rFomT] 


TnFD„ 


+  l«(w)l 


(13) 


3)  If  t  =  0,  and  FDl  =  ■  ■  ■  =  FDm  =  0,  then  (11) 
becomes 


|Zm(u; 0,0,0, •••  ,0)|  «  |q(u)|  +  |n(u)|  (14) 

Doppler  mismatch  happens  quite  often  in  target 
search  where  target  velocity  is  not  yet  known.  How¬ 
ever,  in  target  recognition,  generally  high-resolution 
measurements  of  targets  in  range  (r  =  0)  and 
doppler  are  available,  so  (14)  will  be  used  for  RCS 
value  estimation. 

How  to  combine  all  the  Zm’s  (to  =  1,2,---  ,M)  are 
very  similar  to  the  diversity  combining  in  communations 


to  combat  channel  fading,  and  the  combination  schemes 
may  be  different  for  different  applications.  In  this  paper, 
we  are  interested  in  applying  RSN  waveform  diversity  to 
estimate  the  RCS  parameter,  72.  In  this  paper,  we  pro¬ 
pose  a  maximum  likelihood  algorithm  for  RCS  parameter 
estimation. 


C.  Maximum  Likelihood  Algorithm  for  RCS  Parameter 
Estimation 

For  Swerling  2  model,  the  RCS  voltage  |a(u)|  follows 
Rayleigh  distribution  and  the  I  and  Q  subchannels  of  a(u) 
follow  zero-mean  Gaussian  distributions  with  variance  7 2 
(the  RCS  average  power  value).  Assume 

a(u)  =  a/(u) +jaQ(u)  (15) 

and  n(u )  =  n/(u)  +  jriQ{u)  follows  zero-mean  complex 
Gaussian  distribution  with  variance  c2  for  the  I  and  Q 
subchannels. 

According  to  (8),  (11),  and  (14), 

\Zm(u\ 0, 0, 0,  ■  •  •  ,0)|  «  \a(u)+n{u)\  (16) 

is  a  more  accuate  approximation.  Since  a(u)  and  n(u) 
are  zero-mean  complex  Gaussian  random  variables,  so 
a(u)  +n(u)  is  a  zero-mean  Gaussian  random  variable  with 
variance  72  +  tr2  for  the  I  and  Q  subchannels,  which  means 
Dm  =  \Zm{u;  0,0,  •  ■  ■  ,0)1  follows  Rayleigh  distribution 
with  parameter  y/-/2  +  a2, 

The  mean  value  of  ym  is  ^/^(F+^I,  an(j  variance  is 

(4~ 72.(7  +a  I.  jhe  variance  of  signal  is  .  and  the 

variance  of  noise  is  > — ^ — . 

Let  y  =  [3/1,  i/2,  ••  •  ,  2/m],  then  the  pdf  of  y  is 

M 

/( y)  =  n  (18) 

rn—\ 

so  the  ML  algorithm  to  estimate  the  RCS  average  value 
(72),  let 

0  =  7 2  (19) 


then  (18)  can  be  expressed  as, 
<?ml( y)  =  arg  sup  /( y) 
M 

=  arg  sup  TT 


(20) 


Vm 


exp[- 


+  2 {e  +  o2Y 

Maximizing  /( y)  is  equivalently  to  maximizing 
log /(y)  (natural  logarithm), 


M 


log/(y)  =  y^og( 


m=l 


6  +  o'- 


-) 


2(0  +  ct2)j 


(21) 
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since  it  is  a  continuous  function  for  ym  >  0  and  0  >  0,  so 
a  necessary  condition  for  the  ML  estimation  is 


TM  ,.2 

_  Z-/m=l 


^iog/(y)lfl=eML(y)  = 

which  has  the  unique  solution 


2M{9  +  a2) 


2(9  + a2)2 


\r-^M  2 

0ml  ( y)  = 


Considering  9  >  0, 

0ml  (  y)  =  max 


2M 


r^M  ' 

2-,m  —  1  0»n  _2 


2M 


<7^,0 


=  0 
(22) 

(23) 

(24) 


Since 

d2  4M3 

^  log/(y)le=eMt(y)  =  ~7^aT  ~2,2  <  0  (25) 

\Z-/m= 1  Mm) 

this  solution  gives  the  unique  maximum  of  log/ (y).  The 
expectation  of  §ML  (y)  is 

/■oo  2 

£e[<Wy)]  =  yo  — 2 ^  m-f(ym)dym  -  rr2 

=  rZLlVl  Vm  .  i& 
y0  2M  0  +  rr2  11  2(0  +  rr2 
=  0 


so  it’s  an  unbiased  estimator. 

Fisher’s  information  for  this  case  can  be  computed  via 


h  =  log/(y)l  =  -Eg 


M(9  +  a2)-E^y\ 


(0  +  a2)3 


(28) 


The  mean  value  of  ym  is  y^4r~^>  anc'  variance  is 

_ ^  )  c  r\  ( 

2 


(4  7r)(e+g-  ) ,  SQ  jjjg  Cramer-Rao  lower  bound  (CRLB)  is 


Vare[9( y)]  > 


h 


(0  +  cr 


2\2 


M 


(29) 


Since  ^log/(y)  in  (22)  is  of  the  form  k(e){9ML(y)  - 
£<>[%)]  for 


m 


AM(9  +  a2)2 


(30) 


we  conclude  that  0ml  (y)  can  achieve  the  CRLB  theo¬ 
retically  [13].  From  29,  it’s  clear  that  CRLB  is  inverse 
proportionally  to  the  number  of  radars  M  in  RSN,  which 
means  RSN  with  larger  M  will  have  much  lower  CRLB. 


D.  Simulations 

For  fluctuating  target  with  RCS  parameter  0  =  2 
(Rayleigh  distribution),  we  ran  Monte  Carlo  simulations 
for  106  realizations  at  each  SNR  value,  and  we  applied 
the  ML  estimation  algorithm  to  estimate  the  parameter  0 


for  each  realization.  In  Fig.  5,  we  plotted  the  variance  of 
the  RCS  ML  estimator  with  different  number  of  radars  in 
RSN.  Observe  that 

1)  the  actual  variance  of  0  matches  exactly  with  the 
CRLB  for  different  number  of  radars  in  RSN,  which 
validates  our  theoretical  results:  our  ML  estimator 
on  RCS  parameter  is  an  unbiased  estimator  and  the 
variance  of  parameter  estimation  matches  CRLB. 

2)  the  actual  variance  of  0  reduces  as  M  increases,  and 
numerically  it  is  reverse  proportional  to  M  as  we 
shown  in  Section  IV. 


Fig.  5.  Variance  of  the  RCS  ML  estimator  with  different  number  of 
radars  in  RSN. 


V.  Conclusions  and  Future  Works 

We  have  proposed  a  Knowledge-based  Ubiquitous  and 
Persistent  Sensor  networks  (KUPS)  for  threat  assessment, 
of  which  “sensor”  is  a  broad  characterization  concept, 
and  it  can  be  organic  sensors,  HUMINT  sensors,  SIGINT 
sensors,  etc.  Our  KUPS  for  threat  assessment  consists 
of  two  major  steps:  threat  detection  using  fuzzy  logic 
systems  and  threat  parameter  estimation  using  radar  sensor 
networks.  Our  fuzzy  logic  systems  can  combine  the  lin¬ 
guistic  knowledge  from  different  intelligent  sensors  which 
contains  lots  of  uncertainties.  We  propose  a  ML  estimation 
algorithm  for  target  RCS  parameter  estimation.  Theoreti¬ 
cally  we  show  that  our  ML  estimator  is  unbiased  and  the 
variance  of  parameter  estimation  matches  the  Cramer-Rao 
lower  bound.  Simulations  further  validate  these  theoretical 
results. 

The  proposed  techniques  will  increase  the  sensitivity 
and  performance  of  existing  and  future  NCW,  enhancing 
ship  self  defense  modes  against  stealthy,  sea  skimming, 
anti-ship  cruise  missiles.  In  future  works,  we  will  also 
infer  intent  of  objects/entities,  or  groups  of  objects,  in 
the  regions  of  interest.  We  will  also  study  methods  for 
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constructing  and  learning  a  wide  variety  of  models  of 
threat  behavior;  methods  for  reasoning  with  uncertain  and 
incomplete  information  for  assessing  threats  from  object 
activities. 

Acknowledgement 

This  work  was  supported  by  the  U.S.  Office  of  Naval 
Research  (ONR)  Young  Investigator  Program  Award  under 
Grant  N000 14-03- 1-0466. 

The  author  would  like  to  thank  ONR  Program  Officer 
Dr.  Rabinder  N.  Madan  for  his  direction  and  insightful 
discussion  on  radar  sensor  networks. 

References 

[1]  Y.  Cai,  et  al,  “A  Situation  and  Threat  Assessment  Model  Based  on 
Group  Analysis,”  Proceedings  of  2005  International  Conference  on 
Machine  Learning  and  Cybernetics,  Aug.  2005,  pp.  356  -  361. 

[2]  J.  W.  Choi;  J.  W.  Joo;  D.  L.  Cho,  “Situation/Threat  Assessment 
Fusion  System  (STAFS),”  Proceedings  of  the  Fifth  International 
Conference  on  Information  Fusion,  July  2002,  pp.  1374  -  1380. 

[3]  Q.  Changwen,  H.  You,  “A  method  of  threat  assessment  using  multiple 
attribute  decision  making,”  6th  International  Conference  on  Signal 
Processing,  Aug.  2002,  pp.  1091  -  1095. 

[4]  I.  W.  Dali,  “Threat  assessment  without  situation  assessment,”  Proc 
of  Information,  Decision  and  Control,  Feb.  1999,  pp.  365  -  370. 

[5]  P.  Gonsalves,  el  al,  “Intelligent  threat  assessment  processor  (ITAP) 
using  genetic  algorithms  and  fuzzy  logic,”  Proceedings  of  the  Third 
International  Conference  on  Information  Fusion,  2000. 

[6]  T.-C.  Hou  and  T-J.  Tsai,  “  An  access-based  clustering  protocol 
for  multihop  wireless  ad  hoc  networks,”  IEEE  J.  Selected  Areas 
in  Communications,  vol.  19,  no.  7,  pp.  1201-1210,  July  2001. 

[7]  A.  Iwata,  C.  C.  Chiang,  G.  Pei,  M.  Gerla,  and  T.  W.  Chen,  “  Scalable 
routing  strategies  for  ad  hoc  networks,”  IEEE  J.  Selected  Areas  in 
Communications,  vol.  17,  pp.  1369-1379,  1999. 

[8]  T.  Jan,  “Neural  network  based  threat  assessment  for  automated 
visual  surveillance,”  IEEE  International  Joint  Conference  on  Neural 
Networks,  vol.  2,  July  2004,  pp.  1309  -  1312. 

[9]  Q.  Liang,  “Waveform  Design  and  Diversity  in  Radar  Sensor  Net¬ 
works:  Theoretical  Analysis  and  Application  to  Automatic  Target 
Recognition,”  accepted  by  Int  7  Workshop  on  Wireless  Ad  Hoc  and 
Sensor  Networks,  June  2006,  New  York. 

[10]  Q.  Liang,  “Radar  Sensor  Networks:  Algorithms  for  Waveform 
Design  and  Diversity  with  Application  to  ATR  with  Delay-Doppler 
Uncertainty,”  submitted  to  EURASIP  J.  on  Wireless  Communications 
and  Networking. 

[1 1]  C.  R.  Lin  and  M.  Gerla,  “Adaptive  clustering  in  mobile  wireless 
networks,”  IEEE  J.  Selected  Areas  in  Communications,  vol.  16,  pp. 
1265-1275,  1997. 

[12]  J.  M.  Mendel,  “Fuzzy  Logic  Systems  for  Engineering  :  A  Tutorial,” 
Proceedings  of  the  IEEE,  vol.  83,  no.  3,  pp.  345-377,  March  1995. 

[13]  J.  M.  Mendel,  Lessons  in  Estimation  Theory  for  Signal  Processing, 
Communications,  and  Control,  Prentice-Hall,  Upper  Saddle  River, 
NJ,  1995. 

[14]  J.  M.  Mendel,  Uncertain  Rule-Based  Fuzzy  Logic  Systems,  Prentice- 
Hall,  Upper  Saddle  River,  NJ,  2001. 

[15]  J.  E.  Manley,  G.  M.  Mineart,  A.  E.  Sheridan,  “Underwater  ma¬ 
rine  domain  awareness  for  first  responders:  a  low  cost  approach,” 
MTS/IEEE  TECHNO-OCEAN  ' 04  vol.  3,  2004,  pp.  1695  -  1700. 

[16]  X.  T.  Nguyen,  “Threat  assessment  in  tactical  airborne  environ¬ 
ments,”  Proceedings  of  the  Fifth  International  Conference  on  Infor¬ 
mation  Fusion,  July  2002. 

[17]  N.  Okello,  G.  Thoms,  “Threat  assessment  using  bayesian  networks,” 
Proceedings  of  the  Sixth  International  Conference  of  Information 
Fusion,  vol.  2,  pp.  1102  -  1109,  2003. 


[18]  C.  E.  Perkins,  “Chapter  4,  Cluster-Based  Networks,”  Ad  Hoc 
Networking,  Edited  by  C.  E.  Perkins,  pp.  75-138,  Addison- Wesley, 
2001. 

[19]  J.  Roman,  M.  Rangaswamy,  D.  Davis,  Q.  Zhang,  B.  Himed,  and 
J.  Michels,  “Parametric  adaptive  matched  filter  for  airborne  radar 
applications,”  IEEE  Trans.  Aerosp.  Electron.  Syst.,  vol.  36,  no.  2,  pp. 
677-692,  2000. 

[20]  M.  A.  Richards,  Fundamentals  of  Radar  Signal  Processing, 
McGraw-Hill  Companies,  New  York,  2005. 

[21]  M.  I.  Skolnik,  Introduction  to  Radar  Systems,  3rd  ed,  New  York, 
McGraw  Hill,  2001. 

[22]  P.  Swerling,  “Probability  of  detection  for  fluctuating  targets”,  IRE 
Trans  on  Information  Theory,  vol.  6,  pp.  269-308,  April  1960. 

[23]  G.  Petterson,  el  al,  “Multi-source  integration  and  temporal  situation 
assessment  in  air  combat,”  1999  Information,  Decision  and  Control, 
Feb.  1999,  pp.  371  -  375. 

[24]  A.  N.  Steinberg,  “An  approach  to  threat  assessment,”  2005  8th 
International  Conference  on  Information  Fusion,  vol.  2,  July  2005. 


8 


Orthogonal  Waveform  Design  and  Performance 
Analysis  in  Radar  Sensor  Networks  (RSN) 

Jing  Liang  and  Qilian  Liang 
Department  of  Electrical  Engineering 
University  of  Texas  at  Arlington 
416  Yates  Street 
Nedderman  Hall,  Rm  518 
Arlington,  TX  76019 

Email:  jliang@wcn.uta.edu,  liang@uta.edu 


Abstract —  In  radar  sensor  networks  (RSN),  radar  sensors  are 
likely  to  interfere  with  each  other  if  their  waveforms  are  not 
properly  designed.  We  propose  orthogonal  waveforms  for  RSN, 
which  eliminate  interference  when  no  doppler  shift  is  introduce. 
Additionally,  this  approach  applies  the  advantage  of  spacial  diver¬ 
sity  through  equal  gain  combination  performed  by  clusterhead. 
When  doppler  shift  is  considered  and  interference  is  unavoidable, 
we  analyzed  the  performance  of  this  design  not  only  in  coherent 
RSN,  but  in  noncoherent  systems  as  well.  The  latter  scenario 
is  more  challenging  as  doppler-shift  uncertainty  results  in  more 
complicated  implementation.  Monte  Carlo  simulation  shows  that 
our  technique  provides  much  better  detection  performance  than 
single  radar  for  fluctuating  targets,  in  terms  of  probability  of 
false  alarm  and  miss  detection.  Conclusions  arc  drawn  based  on 
our  analysis  and  further  related  research  areas  are  discussed. 

I.  Introduction  and  Motivation 

It  is  known  to  all  that  slow  fluctuations  of  target  radar 
cross  section  (RCS)  result  in  radar  target  fades,  which  is  a 
main  factor  in  performance  degradation  [8],  Faced  with  the 
challenge  from  weak  RCS  targets,  such  as  cruise  missiles  and 
stealth  targets,  modem  radar  sensors  demand  higher  capability 
of  accurate  target  detection  and  range  estimation.  In  order 
to  satisfy  this  requirement,  much  attention  has  been  paid  to 
waveform  design. 

Among  the  existing  works,  Bell  [1]  applied  information 
theory  to  design  radar  waveforms.  He  demonstrated  that  when 
the  transmitted  radar  waveform  is  scattered  by  the  target, 
larger  SIR  is  achieved,  so  in  order  to  better  detect  target, 
distributing  energy  may  be  a  perfect  choice.  Sowelam  and 
Tewfik  [2]  studied  signal  selection  procedure  for  sequential 
radar  target  classification.  In  their  design,  the  criterion  to 
chose  signal  is  whether  it  maximizes  the  Kullback-Leiber  in¬ 
formation  numbers.  Their  research  focused  on  two-class  signal 
selection  and  Gaussian  unequal  mean  target  models.  In  [3] 
Sun  et  al.  applied  several  fusion  schemes  to  study  constant 
frequency  (CF)  and  linear  frequency  modulated  (LFM)  wave¬ 
forms,  which  improved  detection  probability  and  estimation 
accuracy.  However,  all  the  above  research  involves  only  single 
radar. 

In  nature,  a  network  of  multiple  radar  sensors  can  been 
introduced  to  combat  performance  degradation  of  single  radar 
along  with  waveform  optimization.  These  radar  sensors  are 


managed  by  an  intelligent  clusterhead  that  combines  waveform 
diversity  in  order  to  satisfy  the  common  goals  of  the  network 
other  than  each  radar  operate  substantively. 

Apart  from  better  performance,  RSN  are  capable  to  solve 
blind  speed  problem.  Radar  blind  speed  occurs  when  the 
doppler  shift  is  equal  to  the  same  or  a  multiple  of  the  pulse 
repetition  frequency  (PRF).  Under  these  circumstances,  target 
return  is  suppressed  so  that  a  zero  signal  is  obtained  [7],  As 
for  RSN,  if  PRF  of  each  member  is  properly  designed,  for 
instance,  co-prime  to  each  other,  the  probability  of  blind  speed 
occurrence  will  be  tremendously  reduced. 

Although  the  idea  of  multistatic  radar  which  employs  mul¬ 
tiple  transmitters  or  receivers  to  sample  the  target  static  scat¬ 
tering  behavior  is  not  new,  spatial  diversity  is  still  neonatal  to 
radar  research.  Fishier  et  al.  presented  statistical  MIMO  radar 
system  in  [5].  Their  system  applied  target  spatial  diversity, 
thus  obtained  approximately  constant  average  received  energy 
and  superior  detection  performance  to  that  achieved  through 
coherent  processing.  However,  effects  of  doppler  shift  and 
clutter  were  not  considered.  In  [4],  Liang  performed  theoretical 
studies  on  constant  frequency  (CF)  pulse  waveform  design  and 
proposed  maximum-likehood  (ML)  automatic  target  recogni¬ 
tion  (ATR)  approach  for  both  nonfluctuaing  and  fluctuating 
targets.  Nevertheless  these  studies  also  assumed  no  delay- 
doppler  uncertainty. 

This  paper  leans  heavily  on  prior  research  on  radar  wave¬ 
form  design  and  spacial  diversity.  What  distinguishes  it  is  its 
detailed  performance  analysis  for  both  coherent  and  nonco¬ 
herent  RSN  when  doppler  shift  is  considered.  The  rest  of 
this  paper  is  organized  as  follows.  Section  II  describes  our 
propagation  models.  Section  III  and  IV  analyzes  coherent  and 
noncoherent  detection  respectively.  Simulations  are  given  in 
Section  V  and  Section  VI  concludes  the  paper. 


II.  Waveform  Model  and  Problem  Formulation 

We  assume  our  RSN  consists  of  N  radars  networked 
together  in  a  self-organizing  fashion.  Their  propagation  and 
target  model  is  shown  in  Fig.l.  The  ith  radar  transmits  a 


radar  i 


where  c  is  the  speed  of  light,  and  <j>  follows  uniform  distribu¬ 
tion  within  [0,27t],  As  fc  »  A j. 


Fig.  1.  Propagation  and  target  model  for  RSN 


waveform  typically  modeled  as 


Si{t)  =  Asi{t)  ■ 


I—  cos[2tt(/c  +  Aj)f] 

Jp 


(1) 


where  tilde  on  Si  denotes  that  the  signal  has  been 
modulated.  ASi(t)  is  the  constant  amplitude  pulse  enve¬ 
lope  [7],  is  a  normalization  factor  to  ensure  that 

JqP  | ■  cos[27r(/c  +  Aj)f] |  dt—  1.  Here  Tp  is  the  time 
duration  for  radar  pulses  and  each  oscillator  of  radar  works  at 
a  different  frequency:  =  fc  +  A,,  fc  »  A;,  where  fc  is  the 
system  carrier  frequency  and  A,  must  satisfies  the  following 
equation: 

Afc  —  A i  —  {k  —  i)  ■  —  (2) 

1p 


Here  k  and  i  are  unequal  integers  to  provide  orthogonality 
waveform  for  each  radar  member  [9]. 

Consider  a  point  moving  target  at  an  instant  range  R{. 
ti  second  after  transmitting  the  pulse,  the  ith  radar  receives 
waveforms  of  useful  back-scattered  radiation  from  the  target 
with  inteference  from  other  radar  sensors  as  well  as  clutter 
and  noise.  The  useful  signal  can  be  modeled  as 


Sri{t)  =  Ai(t)  •  y  Y  cos[2?r(/c  +  Ai  +  fdi)(t  -  U)}  (3) 

where  Ai(t)  represents  amplitude  of  the  returned  useful  signal. 
As  a  fluctuating  target  is  more  realistic  than  nonfluctuating 
object,  it  is  more  reasonable  to  apply  “Swerling  II”  model  in 
our  situation  and  thus  Ai(t)  follows  Rayleigh  distribution  [8], 
The  target  mobility  introduces  doppler  shift  in  returned  signal, 
which  is  denoted  by  /*.  Assume  the  target  is  moving  at  the 
speed  v,  than  fdi  is 


f  _v{fc  +  Ai) 

Jdi  —  ’  COS(p 


(4) 


/<«  =  —  •  costf)  (5) 

c 

In  addition,  the  estimated  instant  range  to  the  target  is  Ri  — 
c  ■  ^  [6], 

As  all  of  the  radar  sensors  are  transmitting  signals,  the  ith 
radar  not  only  receives  its  own  back-scattered  waveform,  but 
also  scattered  signals  generated  by  other  radars,  and  therefore 
under  the  condition  of  timing  synchronization,  interference 
waveforms  received  by  the  ith  radar  can  be  modeled  as 

h{t)  =  ^*(0  •  t/J- cos[2tt(/c  +  Afc  + /dfc)(f  -  fi)] 

fc=i,fc^i  V  lp 

(6) 

where  23*.  (t)  is  the  amplitude  of  interference  from  the  kth 
radar,  which  also  follows  Rayleigh  distribution,  and  fd).  is  the 
doppler  shift  for  kth  radar.  This  is  illustrated  in  Fig.l.  As 
the  interference  from  other  radars  is  the  dominant  interference 
that  results  in  target  fade,  we  may  ignore  clutter  under  these 
circumstances.  Apart  from  interference,  the  ith  radar  also 
receives  additive  white  Gaussian  noise  (AWGN)  rt;(£)  with 
mean  value  zero  and  variance  N0/2.  Therefore,  the  combined 
received  waveform  for  the  ith  radar  is 

Ri{t)  =  Sri(t)  +  Ii{t)  +  ni{t)  (7) 

After  introducing  our  propagation  and  target  model,  further 
analysis  on  coherent  and  noncoherent  RSN  are  carried  out 
respectively. 


III.  Coherent  Detection 

In  coherent  RSN,  radar  members  are  smart  enough  to  extract 
the  knowledge  of  the  exact  doppler  shift  introduced  by  moving 
targets,  so  based  on  the  a-priori  information,  the  demodulator 
of  each  radar  can  be  constructed  as  shown  in  Fig. 2.  According 


Fig.  2.  Coherent  RSN  demodulation  and  waveform  combining 
to  this  structure,  the  combined  received  waveform  R,(t)  is 


processed  by  its  corresponding  matched  filter.  The  output  of 
the  ith  branch  Fj(f)is 


m  -  L 


Ri{t)  ■ .  —  cos[27r(/c  +  +  fdi)(t  -  U)]dt 


It  can  also  be  represented  as 

Yi(t)  =  Si(t)  +  Ii(t)+ni(t)  (9) 

where  Si{t),  h{t),  n» (t)  denotes  the  output  of  useful  signal, 
interference  and  noise  respectively. 


rti+Tr,  I  9 

3i(t)  =  Jt  SriW’JjT 


cos[27 r(/c  +  A;  +  fdi){t  -  ti)]dt 


It  can  be  easily  derived  that 

Si{t)  =  Ai(t) 
Similarly,  /j(f)  is  given  by 


fU+Tv  _  rr- 


cos[27r(/c  +  A i  +  fdi)(t  -  ti)]dt 


Simplifies  the  above  equation,  we  can  obtain  that 


m  =  E 


Bk(t)sin[2ir(fdk  -  fdi)Tp\ 
2ir[(k-i)  +  (, fdk  ~  fdi)TP] 


As  for  noise,  it  can  be  easily  proved  that  nj(f)  is  still  an 
AWGN  with  mean  value  zero  and  variance  N0/2.  Therefore 
the  output  of  the  ith  radar  is 


(14) 

Assume  each  radar  works  at  the  same  level  of  detection 
accuracy,  the  RSN  clusterhead  can  apply  equal  gain  combining 
algorithm  before  making  a  final  decision. 

Since  Swerling  II  models  is  applied,  A,(f)  follows  Rayleigh 
distribution,  which  can  be  modeled  as 

Mt)  =  A!(t)+jAf(t)  (15) 

In  the  same  way,  noise  is  given  by 

ni{t)  =  nIi(t)+jnf(t)  (16) 

both  I  and  Q  subchannels  of  A;(t)  and  nt  (t)  follow  zero- 
mean  Gaussian  distribution  with  corresponding  variance  7? 


in  above 


and  a2  [4],  Assume  each  | \  in  above 
equation  follows  Rayleigh  distribution  with  variance  /I2  for  its 
I  and  Q  subchannels,  then  |ki(f)|  follows  Rayleigh  distribution 
with  parameter 

ai  =  \!  7 i  +  Pi  +  (17) 


E  0* 

k=l,k^i 


so  when  there  is  a  moving  target,  the  pdf  for  \Yi(t)\  is 

Mw)  =  5ea;P(-&)  (19^ 

ai  Loti 

The  mean  value  of  y,  is  at  y7| ,  and  variance  is  (2  -  \ )af. 

The  variance  of  useful  signal  is  (2  —  f)72i  and  the  vaiance 

2 

of  noise  is  (2  -  | )af.  Therefore,  SNR  is  0  and  it  is  worth 

2l 

mentioning  that  SIR  for  coherent  RSN  is  . 

After  equal  gain  diversity  combination,  tHe  synthesized  pdf 
becomes 


In  case  of  no  target,  i.e.  there  exits  only  noise  ,  and  hence  the 
pdf  of  \Yi{t)\  is  given  by 

fn{Vi )  =  2)  (21> 

and  accordingly  the  synthesized  pdf  becomes 

/„(y)  =  n^ip(-i)  (22) 

1  *  * 

Based  on  the  knowledge  of  pdf,  we  apply  Bayesian’s  rule  to 
obtain  our  decision  criterion  , which  is 

target  exists 

fs( y)  <  ,  Pn  , 

— r—  no  target  —  (23) 

Jn{ y)  *8 

where  Pn  denotes  the  probability  of  no  target  but  noise  and 
Ps  represents  the  probability  of  target  occurrence. 


IV.  Noncoherent  Detection 

As  far  as  noncoherent  RSN  is  concerned,  the  difference 
from  the  above  system  is  that  radars  have  no  knowledge  of 
the  accurate  doppler  shift  introduced  by  moving  targets,  so 
each  matched  filter  applies  the  same  frequency  as  transmitted 
carrier.  Although  this  situation  is  more  complicated  to  handle, 
as  for  modem  military  it  is  more  practical.  Our  construction 
of  RSN  demodulators  is  shown  in  Fig. 3. 

In  terms  of  this  structure,  the  received  signals  of  the  ith 
radar  is  first  multiplied  by  separate  cosine  and  sine  waveforms 
generated  by  the  local  oscillator.  The  receiver  then  sums  of  the 
sine  and  cosine  correlations,  extracts  its  envelope,  and  then 
transmits  the  result  to  RSN  cluterhead,  which  will  make  final 
decision  based  on  the  combined  information  colletcted  by  each 
radar  member.  It  is  obvious  that  because  of  not  knowing  the 
doppler  shift,  this  system  involves  nonlinear  operations  unlike 
the  coherent  RSN  in  the  above  section. 

As  for  the  ith  radar,  the  output  of  inphase  branch  is 


7<‘>=r 


Ri{t) ■  < /—  cos[2?r(/c  +  Ai)(t-ti)\dt  (24) 


Similar  to  equation(9),  it  can  be  also  represented  as 
Y/{t)  =  S*  (t)  +  I Ht)  +  n{  ( t ) 


(25) 


(35) 


■TP  ,,  //(f).  ,2 

'o  {)dt— ^  h|/,wi 
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— jpoa^  fj 
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tf(o  =  E 
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Fig.  3.  Noncoherent  RSN  demodulation  and  waveform  combining 


S((t)  =  Aj(t)  •  sinc{2ir f diTv)  (26) 

N 

l((t)  =  E  Bk(t)sinc  [27r(Afc  -  Aj  +  fdk)Tp)\  (27) 

/c=l , 

And  n{  ( t )  is  the  noise  in  inphase  branch.  In  the  same  way, 
the  output  of  quadrature  branch  is 


Based  on  the  above  equations  and  the  construction  in  Fig. 3, 
it  can  derived  that 


, _ MA  - _ 

/ A2(t)sin29j  |  2Ai(t)Bir(t)sin9iSin6kCos(Bi—9k)  . 

y _ 0? _ **  2^fc=l,fc^z  (7r(/c — ' 

EN  B2 sin2 9k  |  2 Bm  Bj  amflm sinflj  cos(flm —9j) 

k=l ,k^i  [7r(fc— i)-t-0jc]2  '  [7r(m— i)+0TO][7r(j— i)+0j-] 

+"?W 

(37) 

We  can  have  two  special  cases  as  follows: 

1)  If  there  is  no  doppler  shift,  i.e.  fdi  =  0  and  (lt  =  0,  then 
stn^i  _j  ancj  tp,us  equation(37)  becomes 

l^(<)l  =  \J  A\(t)  +  n?(f)  (38) 

2)  If  there  is  only  one  radar,  then  equation(37)  becomes 

\Yi(t)\  =  +  n2{t)  (39) 

Similar  to  coherent  system,  in  equation(37)  we  may  assume 


Ai{t)sin9j 


s~^N  2Ai(t)Bk(t)sin9iSin0kCos(6i  —  9k)  , 

'  /L/k=l,k&  f-Ti-Cfc — n 


PH-r-tp  /  o 

Y?{t)  =  Ri(t)-J  —  sin[27r(/c+A,)(f— fj)]df  (28) 


which  can  also  be  given  as 


yiQ(t)  =  5lQ(t)  +  /f(t)+nf(t) 


,q,.,  _  Aj(t)  [cos(27r/dirp)  -  1] 


s?(t)  = 


ZnfdiTp 


jQn\  _  y-'  Bk{t)  {cos[27r(Afc  —  A,  +  fdk)Tp]  —  1} 

1  ^  27r(A  k  —  Aj  +  fdk)Tp 


k—l,k^i 


and  nf(t)  is  the  noise  in  quadrature  branch. 

To  simplify  the  computation,  we  can  define 

0i  =  nfdiTp  (3 

so  equation(26)(27)(30)(31)  are  also  respectively  given  by 

o/y.x  _  Ai(t)sin6icos0i 

ai  V1)  —  - 2 -  IJ 


riu\  Bk{t)sin8kcos6k 


Miat  |  6i  \<y^k=i,k& _ [TT(fe— o+gfcjgj  2. 

EN  Blsin20k  .  2BrnBisin6msinBjCos(8m—6j) 

fc=l,fcyti  [7r(fc_t)+fi,t]2  f"  [rr(m-i)+em][7rO-i)+0j] 

and  |n,(f)|  follow  Rayleigh  distribution  with  variance  7?, 
rjf  and  of  for  their  corresponding  I  and  Q  subchannels,  and 
therefore  | (i)  |  follows  Rayleigh  distribution  with  parameter 

ai  =  \fli+Vi+^i  (40) 

Similarly,  we  may  apply  the  equation(19)(20)  (21)(22)(23) 
in  coherent  RSN  to  analyze  the  dection  performance  and  in 
noncoherent  RSN,  SIR  is 

vi 

V.  Simulations 

In  order  to  study  the  performance  versus  the  number  of 
radars  in  RSN  under  the  condition  of  doppler  shift,  we  apply 
our  waveform  models  and  equal  gain  combination  technique 
in  our  simulation  with  following  parameters: 

1)  ^  =  5OOOH0,  i.e.  ( fdi)max  =  5000 Hz 
2)  Tp  =  1ms 

3)  Pn  =  P, 

4)  the  mean  value  and  variance  of  Bk{t )  are  equal  to  those 
of  Ai{t) 

5)  106  Monte-Carlo  simulations 

The  average  SNR  value  refers  to  the  average  SNR  of  all 
the  members  in  RSN.  Specifically  SNR  of  each  member  is 
the  power  of  received  useful  back-scattered  signal  to  that  of 
noise. 

The  performance  of  the  envelope  detector  in  single  radar, 
3-radar  and  6-radar  coherent  RSN  are  compared  in  Fig.4  and 


10 


Fig. 5,  in  terms  of  the  probability  of  false  alarm  and  probability 
of  miss  detection  respectively.  Fig.4  shows  that,  to  achieve  the 
same  Pfa  =  0.1,  single  radar  requires  about  7dB  greater  than 
the  SNR  of  6-radar  RSN.  In  latter  system,  even  SNR  is  ldB, 
Pfa  is  around  0.1,  which  is  good  enough  [6]. 

Observe  Fig. 5,  we  can  see  that  Pm  of  single  radar  is 
much  larger  than  0.1  even  SNR  reaches  lOdB.  When  Pm  = 
0.1,  SNR  of  3-radar  and  6-radar  are  around  8.3dB  and  4dB 
respectively.  These  two  figures  demonstrate  that  our  coherent 
RSN  could  provide  superior  detection  performance  to  that  of 
single  radar. 


Fig.  4.  Probability  of  false  alarm  for  coherent  RSN 


Fig.  5.  Probability  of  miss  detection  for  coherent  RSN 

In  noncoherent  RSN,  the  probability  of  false  alarm  and 
probability  of  miss  detection  are  shown  in  Fig. 6  and  Fig. 7. 
These  two  figures  clearly  illustrate  that  performance  of  non¬ 
coherent  single  radar  is  extremely  worse  than  that  of  coherent 
system  and  it  can  not  work  properly  while  SNR  is  below 


Fig.  6.  Probability  of  false  alarm  for  noncoherent  RSN 


Fig.  7.  Probability  of  miss  detection  for  noncoherent  RSN 


or  even  equal  to  lOdB.  Clearly,  even  3-radar  RSN  could  not 
provide  enough  performance  improvement.  Applying  6-radar 
RSN,  performance  has  been  improved  a  lot  compared  to  single 
radar.  Here  PFa  =0.1  is  achieved  when  SNR  is  greater  than 
3dB  and  to  satisfy  Pm  <  0.1,  SNR  must  be  increased  to 
around  5.3  dB. 

Meanwhile,  it  is  clear  to  see  no  matter  how  many  radars 
have  been  exploited  in  our  RSN,  the  performances  of  nonco¬ 
herent  RSN  are  worse  the  coherent  system,  i.e.,  noncoherent 
RSN  demand  higher  energy  in  order  to  achieve  the  same 
performance. 

From  the  above  figures,  it  is  obvious  that  although  doppler 
shift  generates  interference  between  each  radar  sensor,  promis¬ 
ing  detection  performance  has  been  obtained  applying  our 
orthogonal  waveform  and  combination  technique.  Particularly, 
this  design  provides  greater  performance  in  case  of  noncoher- 


ent  10-radar  RSN. 


VI.  Conclusion  and  Future  Works 

We  have  studied  orthogonal  waveforms  and  spacial  diversity 
under  the  condition  of  doppler  shift  in  radar  sensor  networks 
(RSN).  Both  cases  of  coherent  detection  and  noncoherent 
detection  have  been  analyzed.  In  case  of  no  doppler  shift, 
our  orthogonal  waveforms  eliminate  interference  between  each 
member.  However,  when  there  is  doppler  shift,  there  exists 
interference  that  can  not  be  avoid.  Simulation  results  show 
that  applying  equal  gain  combination  technique  through  clus- 
terhead  to  our  signal  models,  better  detection  performance  than 
single  radar  is  achieved  for  fluctuating  targets.  This  design 
not  only  satisfies  higher  demanding  criterion  for  detection 
accuracy  in  modern  military  and  security  affairs,  but  also  offers 
advantage  to  combat  the  blind  speed  problem. 

One  can  extend  the  above  procedure  in  several  directions: 

1)  In  this  paper  we  only  considered  constant  frequency 
(CF)  pulse  waveform  design.  Naturally,  we  would  like 
to  extend  our  results  to  other  forms,  such  as  linear 
frequency  modulation  (LFM)  and  binary  phase-coded 
pulse,  and  analyze  their  performances. 

2)  For  simplicity,  we  assumed  perfect  timing  here,  so 
the  performance  of  RSN  can  be  studied  when  timing 
ambiguity  is  considered. 

3)  In  our  model,  we  only  assume  there  is  one  moving  target 
and  thus  multiple  targets  co-existence  and  corresponding 
performance  can  be  further  investigated. 
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Abstract — In  this  paper,  the  spatial-temporal-frequency  di¬ 
versity  to  improve  the  detection  performance  of  Radar  Sensor 
Networks  (RSN)  in  the  presence  of  certain  types  of  interference 
(clutter,  jamming,  noise  and  interference  between  radar  sensors) 
is  studied.  In  order  to  reduce  the  interference  between  radar 
sensors  and  maximize  the  signal-to-interference-plus-noise  ratio 
(SINR),  we  propose  a  method  using  the  orthogonality  criterion 
to  design  waveforms  for  radar  sensors  in  the  network.  Besides 
the  interference  between  radar  sensors,  performance  of  the 
network  depends  largely  on  other  interference,  especially  clutter, 
which  is  extended  in  both  angle  and  range,  and  is  spread  in 
Doppler  frequency.  By  using  the  spatial-temporal  diversity,  we 
can  suppress  effects  of  these  interference.  In  this  paper,  we  also 
propose  a  receiver  for  diversity  combining  in  RSN.  As  an  applica¬ 
tion  example,  we  apply  the  spatial-temporal-frequency  diversity 
scheme  to  improve  the  detection  performance  or  reduce  the  miss- 
detection  probability  at  a  low  false  alarm  probability.  Simulation 
results  for  both  non-fluctuating  targets  and  fluctuating  targets 
show  that  the  performance  of  our  proposed  scheme  is  superior 
to  that  of  the  single  radar  with  the  spatial-temporal  diversity 
only. 

I.  INTRODUCTION 

Radar  sensor  network  (RSN)  consists  of  collaboratively 
operating  radar  sensors  which  are  deployed  ubiquitously  (pos¬ 
sibly  randomly  placed)  on  airborne,  surface  and  sub-surface 
unmanned  vehicles.  Each  sensor  in  the  network  has  capabil¬ 
ities  for  radar  sensing,  signal  processing  and  wireless  com¬ 
munication.  Autonomous  radar  sensors  operating  in  the  mi¬ 
crowave  spectrum  are  used  to  detect,  classify  and  track  visible, 
obscured  or  hidden  targets  such  as  tactical  weapons,  aircraft, 
ships,  spacecraft,  vehicles,  people,  and  the  natural  environment 
in  the  presence  of  both  noise  and  interference  (clutter,  jamming 
and  interference  between  sensors).  Information  about  a  target 
is  wirelessly  forwarded  to  the  central  processor,  where  target 
identification  and  network-wide  tracking  are  conducted  using 
sensor  data  from  every  sensor  in  the  network,  along  with  their 
position  and  timing  information. 

In  the  radar  sensor  network,  radar  sensors  are  networked 
together  in  an  ad-hoc  fashion,  i.e.,  they  do  not  depend  on  any 
preexisting  infrastructure.  They  are  self-organizing  entities  that 
are  deployed  on  demand  in  support  of  various  events  such 
as  surveillance,  battlefield,  disaster  relief,  search  and  rescue, 
etc.  RSN  has  advantages  compared  to  a  single  radar  system 
in  improving  the  system  sensitivity,  reducing  obscuration 
effects  and  vulnerability  as  well  as  increasing  the  detection 


performance.  However,  when  deploying  the  RSN,  we  have  to 
solve  some  challenging  problems  such  as  networking  between 
radar  sensors,  canceling  effects  of  interference,  power  efficient 
communication,  and  reducing  complexity  of  signal  processing 
schemes,  etc.  Only  few  work  doing  research  on  these  aspects 
has  been  developed.  Recently,  S.  Kadambe  [1]  proposed  a 
minimax  entropy-based  technique  to  reduce  the  processing 
complexity  in  the  RSN.  In  [2],  relative  merits  of  the  RSN  and 
the  balance  between  increased  performance,  complexity,  and 
cost  were  discussed.  In  this  paper,  we  will  propose  a  method 
to  design  the  waveform  in  order  to  cancel  the  interference  be¬ 
tween  radar  sensors  and  maximize  the  signal-to-interference- 
plus-noise  ratio  (SINR).  In  research  literature  on  the  waveform 
design,  Fitzgerald  [3]  demonstrated  the  inappropriateness  of 
waveform  selection  based  on  measurement  quality  alone:  the 
interaction  between  the  measurement  and  the  track  can  be 
indirect,  but  must  be  taken  into  account.  Bell  [4]  used  the 
information  theory  to  design  waveform  for  the  measurement 
of  extended  radar  targets  exhibiting  resonance  phenomena. 
Baum  [5]  used  the  singularity  expansion  method  to  design 
some  discriminant  waveforms.  However,  these  design  methods 
were  used  for  the  single  radar  only.  In  [18],  radar  sensor 
networks  for  automatic  target  recognition  was  studied,  but 
clutter  and  jammer  were  not  considered. 

Performance  of  the  RSN  depends  largely  on  the  interference 
which  is  extended  in  both  angle  and  range,  and  is  spread  in 
Doppler  frequency  because  of  motion  of  the  platform  and 
target.  Space-Time  Adaptive  Processing  (STAP)  or  spatial- 
temporal  diversity  has  become  an  excellent  technique  to  sup¬ 
press  effects  of  interference.  STAP  refers  to  the  simultaneous 
processing  of  the  spatial  samples  from  an  array  antenna  and  the 
temporal  samples  provided  by  the  echoes  from  multiple  pulses 
of  a  coherent  processing  interval  (CPI).  A  considerable  amount 
of  work  has  been  done  to  develop  STAP  for  processing  data 
from  airborne  or  space-borne  radars  to  reliably  detect  moving 
targets  of  interest  in  the  presence  of  strong  clutter  returns 
and  jamming  [6]  -  [9].  By  combining  waveform  design  and 
spatial-temporal  diversity,  we  can  perform  spatial-temporal- 
frequency  diversity  in  RSN.  Studies  in  this  paper  will  show 
that  using  our  proposed  diversity  scheme  will  improve  the 
detection  performance  with  a  low  false  alarm  probability. 

The  remainder  of  this  paper  is  organized  as  follows.  In 
section  II,  we  propose  a  method  to  design  waveforms.  Spatial- 
temporal  diversity  and  interference  analysis  are  discussed  in 


the  section  III.  In  section  IV,  we  propose  a  diversity  combining 
scheme  and  analyze  detection  performance  for  non-fluctuating 
targets  as  well  as  fluctuating  targets  using  our  proposed 
diversity  scheme.  Simulation  results  and  performance  analysis 
are  discussed  in  section  V,  and  in  section  VI,  we  conclude  the 
paper. 

II.  Waveform  design 

In  radar  sensor  networks,  radar  sensors  will  interfere  with 
one  another  and  SINR  will  be  very  low  if  waveforms  are  not 
properly  chosen.  In  order  to  have  waveforms  designed  properly 
and  coexisted  in  the  network,  we  propose  orthogonality  as  one 
criterion  to  design  waveforms. 

In  our  radar  sensor  networks,  we  choose  constant  frequency 
(CF)  pulse  waveform.  The  CF  waveform  can  be  defined  as 

x{t)  =  J^exp(j2Ttft)  0  <  t  <  T  (1) 

where  E  is  the  energy  of  the  waveform  and  T  is  the  waveform 
pulse  duration. 

We  know  that  the  waveforms  from  different  radar  sensors 
will  interfere  with  one  another.  We  choose  the  waveform  for 
radar  i  as 

xi{t)  =  \J^exp{j2ir{f  +  Aj)t)  0<f<T  (2) 

which  means  that  there  is  a  frequency  shift  A;  for  the  radar 
sensor  i.  In  order  to  minimize  the  interference  between  radar 
sensors,  we  will  find  a  set  of  frequency  shifts  {A;}^1  (M 
is  the  number  of  radar  sensors)  for  which  the  waveforms  are 
orthogonal.  Let  R(k,l)  denote  the  cross-correlation  between 
the  waveforms  xk{t)  and  xi(t). 

R(k,l)  =  f  Xk(t)x*{t)dt  (3) 

Jo 

=  £sinc[(Ajt  -  Ai)T]exp\jn{Ak  -  A;)T] 

If  7r(Afc  —  A i)T  =  iir,  the  waveforms  xk{t)  and  xi{t)  are 
orthogonal,  i.e., 


So,  we  can  choose  a  set  of  frequency  shifts  {A*}^1  as 
below  [18]: 

Ai  =  Afc  -  A,  =  1  i  =  0,  1,  ...,  M  -  1  (4) 

Based  on  (4),  we  can  confirm  that  the  waveforms  can  co-exist 
if  the  frequency  shift  is  i/T  between  two  waveforms,  i.e., 
orthogonality  among  waveforms  can  be  achieved  by  separating 
frequencies  of  waveforms  by  multiplying  an  integer  with  the 
inverse  of  the  waveform  pulse  duration.  So,  we  will  choose  the 
waveforms  by  this  method  to  get  radar  sensors  coexisted  in  the 
network.  Moreover,  by  using  this  waveform  design  method,  we 
can  perform  a  frequency  diversity  in  the  RSN. 


Antenna  1  Antenna  2  Antenna  N 


y  =  wx 


Fig.  1.  Space-time  beamformer  consisting  of  an  N-element  ULA  and  a 
coherent  processing  interval  (CPI)  comprising  K  pulses  with  a  fixed  PRI  [9] 

III.  Spatial  -  Temporal  diversity  and  interference 

ANALYSIS 

A.  Spatial  -  Temporal  diversity 

At  each  radar  sensor,  we  use  a  receiver  with  an  array 
antenna  as  shown  in  Fig.  1.  This  array  consists  of  an  N- 
element  ULA  with  inter-element  spacing  di  (spatial  degrees 
of  freedom)  and  K  pulse  repetition  interval  (PRI)  time  taps 
(temporal  degrees  of  freedom).  Now,  we  consider  a  signal 
Si(f)  =  impinging  on  the  array.  If  the  wave’s  angle 

of  arrival  relative  to  the  array  is  9 ,  the  signal  observed  at  the 
nth  array  element  is 

mn{t)  =  Aej n  =  o,  1,  AT  -  1  (5) 

where  the  phase  offset  0O  accounts  for  the  absolute  phase  at 
the  first  element.  We  consider  N  samples  formed  from  N  array 
elements  at  a  time  to  and  map  these  N  element  samples  into 
a  vector  form  to  have  a  snapshot  of  the  array  at  a  fixed  time. 

m  =  A\\  1  g-jlndiSinO/Xi  e~j2-n{N -\)disin6  / \i]i 

=  (6) 

where  A\  —  AeAQit+c!,°\  9i  —  diSind/Xi  is  the  normalized 
angle,  (.)'  denotes  the  transpose  operation,  and  as(6i)  is  the 
spatial  steering  vector. 

a5(0;)  =  [l  e~j2lT°'  •••  (7) 

Since  the  target  is  in  motion,  the  normalized  Doppler  shift  at 
the  target  induced  on  the  radar  sensor  i  at  an  angle  9  is 

-  2  v-T 

fdi  =  -r— sin9  =  (36  i  (8) 

where  vl  is  the  velocity  of  the  radar  sensor  i  and  P  =  = 

|d  =Ai-  Each  vector  of  array  outputs  from  successive 
pulses  due  to  the  target  will  have  a  temporal  linear  phase 
progression,  i.e.,  at  the  kth  PRI,  snapshot  of  the  target  takes 
the  form  [9],  [10] 

e{0i  Jdi)  =  ei2Klk-1fi«a,(§i)  k  =  1,  2,  ..,  K  (9) 

If  K  pulses  are  to  be  processed  in  a  coherent  pulse  interval 
(CPI),  the  KN  dimensional  space-time  steering  vector  (snap¬ 
shot)  corresponding  to  a  possible  target  at  look  angle  6  and 
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Doppler  frequency  fdi  is  given  by 

e(6iJdi )  =  h(f  di)®as(8i)  (10) 

where  ®  denotes  the  Knonecker  product  and  bt(fdi)  is  the 
if -dimensional  Doppler  steering  vector 

btQdi)  =  [1  e*2*'"  ...  eJ'2jr(K'-1)7<ii]/  (U) 

By  introducing  the  complex  weighting  vector  Wi,  the  output 
response  of  the  space-time  beamformer  can  be  maximized  for 
any  desired  angle  of  arrival.  More  specifically,  let  xt  and 
denote  the  received  data  at  the  radar  sensor  i  and  beamformer 
output,  respectively. 

Vi  =  w'iXi  (12) 

In  any  case,  the  optimum  weight  vector,  Wi  €  CNK,  that 
maximizes  SINR,  satisfies  the  Weiner-Hopf  equation 

Wi  =  Rje&j*)  (13) 

where  Rdi  e  CNKxNK  is  the  interference-plus-noise  covari¬ 
ance  matrix. 

However,  in  practice,  the  covariance  matrix  Rdi  is  unknown 
and  must  be  estimated.  To  solve  this  problem,  researchers 
have  produced  extensively  algorithms  [12]  -  [15]  to  choose 
an  optimal  set  of  complex  space-time  weights  Wi  in  order  to 
maximize  SINR. 

B.  Interference  analysis  in  Radar  Sensor  Networks 

1)  Clutter:  Clutters  generate  unwanted  radar  returns  that 
may  interfere  with  the  desired  signal.  Parasitic  returns  that 
enter  the  radar  through  the  antenna’s  main-lobe  are  called 
main-lobe  clutter;  otherwise  they  are  called  side-lobe  clutter. 
Clutter  can  be  classified  into  two  main  categories:  surface 
clutter  (including  trees,  vegetation,  ground  terrain,  man-made 
structures,  and  sea  surface)  and  volume  clutter  (including 
chaff,  rain,  birds,  and  insects).  Surface  clutter  changes  from 
one  area  to  another,  while  volume  clutter  may  be  predictable. 
In  many  scenarios,  the  dominant  interference  is  not  noise,  but 
clutter.  Consequently,  the  signal-to-clutter  ratio  (SCR)  is  often 
of  more  important  than  the  signal-to-noise  ratio  (SNR). 

The  integrated  clutter  can  be  generally  approximated  as  the 
sum  of  Nc  elemental  clutter  patches.  For  clutter  patch  i,  the 
space-time  data  vector  [9]  is 

Pi  =  libt(fdci)  ®  as(0ci)  =  HUi  (14) 

where  7 j  is  a  complex  scalar  random  variable  that  accounts 
for  the  amplitude  and  phase  of  the  ith  clutter  patch,  u,  = 
bt{fdci)®as(6ti)-  bt(fdci )  and  «.(**)  are  temporal  vector  and 
spatial  vector  of  clutter  signal  from  clutter  patch  i,  respectively. 
fdci  and  9Ci  are  the  normalized  Doppler  shift  and  angle  of 
arrival  of  the  ith  clutter  patch,  respectively.  The  total  clutter 
vector  equals  to 

Nc  Nc  _  Nc 

Xc  =  YlPi  ~  ®  as(9ci)  =  ^2 liUi  (15) 

i~  1  i= 1  i—  1 

The  covariance  matrix  of  the  clutter  [9]  is  given  by 

Nc  Nc  Nc 

rc  =  e{xcx'c)  =  =  J2  G^i 

i—  1  j— 1  i=l 

(16) 


2)  Jamming :  Jamming  signals  are  generated  by  hostile 
interfering  signal  sources  that  seek  to  degrade  the  performance 
of  radar  sensors  by  mechanisms  such  as  degrading  SINR 
by  increasing  the  noise  level,  or  generating  false  detections 
to  overwhelm  the  radar  with  false  targets.  One  of  the  most 
common  forms  of  jamming  is  a  simple  noise  jammer  that 
radiates  a  relatively  high-power  waveform  at  the  victim  radar 
sensor  from  a  specific  platform.  A  commonly  employed  model 
for  Nj  jamming  signals  [6] 

Ni 

J  zm®aj(8m)  (17) 

m=  1 

where  zm  contains  voltage  samples  of  the  mth  jammer 
waveform  taken  at  PRI.  The  different  jammer  waveforms  are 
uncorrelated  with  each  other. 

3)  Interference  between  radar  sensors:  When  we  deploy 
the  radar  sensor  network,  interference  between  radar  sensors 
need  to  be  studied.  In  [18],  interference  between  radar  sensors 
was  analyzed  in  detail.  Since  radar  sensors  operate  at  different 
frequencies,  they  will  interfere  one  another.  To  suppress  this 
interference,  we  should  choose  waveforms  correctly.  In  this 
paper,  we  choose  the  orthogonality  criterion  to  design  the 
waveforms. 

4)  Noise:  The  echo  signal  received  from  a  target  or  clutter 
inevitably  competes  with  noise.  Noise  can  be  received  through 
antenna  from  external  sources  or  generated  in  the  radar  re¬ 
ceiver  itself.  Of  these  various  sources,  thermal  noise  due  to 
ohmic  losses  at  the  radar  receiver  is  normally  dominant.  We 
can  use  the  Gaussian  process  to  model  the  thermal  noise. 

IV.  Diversity  combining  and  target  detection 

A  radar  sensor  network  is  composed  of  many  radar  sensors 
deployed  in  a  large  geographical  area.  The  radar  sensors 
(nodes)  are  networked  together  in  an  ad-hoc  fashion,  i.e., 
they  do  not  depend  on  any  preexisting  infrastructure.  In 
fact,  they  are  self-organizing  entities  that  are  deployed  on 
demand  to  perform  various  tasks  such  as  surveillance,  disaster 
relief,  search  and  rescue,  etc.  Scalability  concerns  suggest  a 
hierarchical  organization  of  the  radar  sensor  networks  with 
the  lowest  level  in  the  hierarchy  being  a  cluster.  The  clusters 
are  independently  controlled  and  dynamically  reconfigured  as 
nodes  move.  Thus,  this  network  architecture  has  some  main 
advantages  as  follows  [16]  > 

1)  Using  the  radio  resources  efficiently.  For  example,  band¬ 
width  can  be  shared  or  reserved  in  a  controlled  fashion 
in  each  cluster. 

2)  Providing  spatial  and  frequency  reuse  due  to  node  clus¬ 
tering. 

3)  Robustness  with  topological  changes  caused  by  node 
motion,  node  failure,  and  node  insertion/removal. 

4)  Concealing  the  details  of  global  network  topology  from 
individual  nodes. 

In  this  paper,  we  perform  the  waveform  design  for  radar 
sensors  in  the  network.  Each  sensor  in  the  network  will  be 
assigned  a  waveform  with  specific  parameters.  Radar  sensors 
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Fig.  2.  Receiver  at  the  clusterhead  for  diversity  combining 


can  provide  their  parameters  about  waveforms  to  the  cluster- 
head.  Cluster-head  will  collect  and  combine  waveforms  from 
cluster  members. 

The  received  data  at  each  radar  sensor  i  consists  of  the 
desired  signal  and  interference,  i.e., 

Xi(u,t)  =  ai(u)e(Oi,fdi)si(t-Ti)  +  Ci  +  Ji  +  Ii+ni 
=  ai(u)e(9iJdi)si(t-Ti)  +  di  (18) 

where  dd  =  C\  +  Jt  +  /;  +  n,  presents  the  overall  interference, 
i.e.,  the  sum  of  the  clutter  vector  C2,  the  jammer  vector  Jlt 
the  interference  between  radar  sensors  and  the  background 
white  noise  rtj.  cti(u)  is  a  random  variable  that  models  the 
radar  cross  section  (RCS),  e(<7,/di)  is  a  spatial-temporal 
steering  vector  that  models  the  target  return  for  a  specific 
angle-Doppler,  and  s,(f  -  n)  is  the  return  of  waveform  with 
delay  t*. 

The  data  at  the  output  of  the  ith  sensor  is  the  multiplication 
of  the  received  data  Xi(u,t)  and  the  spatial-temporal  weight 
vector  Wi,  i.e., 

yi{u,t)  =  WiXi(u,t) 

=  w'i[ai{u)e{6iJdi)si(t-Ti)  +  di] 

=  ai(u)si{t  -  Ti)Li(9iJdi)  +  Di  (19) 

where  A  =  and  A(07,7di)  =  ^e(7i,7di). 

Assuming  the  radar  sensor  network  with  M  radar  sensors, 
the  received  signal  r(u ,  t)  at  the  cluster-head  is 

M 

r(u,t)  = 

2=1 

M 

=  53Wu)s*(£  -Ti)Li(6iJdi)  +  Di}  (20) 
2  =  1 

Note  that  cti(u)  can  be  modeled  using  non-zero  constants  for 
non-fluctuating  targets  and  four  Swerling  target  models  for 
fluctuating  targets  [17],  Target  fluctuation  lowers  the  probabil¬ 
ity  of  detection,  or  equivalently  reduces  SINR. 

At  the  cluster-head,  we  propose  a  receiver  [18]  as  shown  in 
the  Fig.  2  to  combine  waveforms.  According  to  this  receiver, 
the  received  signal  r(u,  t)  is  processed  by  a  bank  of  matched 
filters.  After  integration,  the  output  of  the  branch  1  is  given 
by 


Zi(u)  =  I J  r(u,t)s*(t  —  T\)dt\ 

M  _  _  fT 

=  \Y ]ai(u)Li(6i,fdi)  /  Si(t  -  Ti)sl{t  -  n)dt 

i= i  Jo 

M  ~T 

+  'V)  s\{t-Ti)Didt\ 

i=  i  Jo 

=  \Zn(u)  +  Z12(u)\  (22) 


where  Zn(u) 

and  Z12(u)  are  defined  as  below 

N) 

7? 

II  > 

M  .T 

'Y'ai{u)Li{ei,fdd)  /  Si{t  — 
Jo 

TiK(t 

—  T\)dt 

2=1 

M  ~T 

= 

y'ai(u)Li(0i,fdi)  /  Si(t- 

Ti)sl(t 

-  T\)dt 

+ 

l—Z 

Eai(u)Li(6iJdl) 

(23) 

Zl2(u)  = 

M  -T 

T  /  sl(t-Ti)Didt 

(24) 

Based  on  (4), 

Z\  i(u)  can  be  rewritten  as 

Z\\{u)  —  Eai{u)Li{0i,  f  dl) 

(25) 

Assuming 

that  waveforms  are  designed  properly. 

So,  the 

interference  between  radar  sensors  is  negligible.  Since  the 
detection  performance  of  RSN  is  greatly  affected  by  the  clutter, 
we  consider  the  clutter  the  primary  interference  source.  The 
overall  interference  can  be  given  by 

Nc 

di  —  Ci{f  dci  i  Oci)  +  Tli  =  7  ]  'l/ij'U'ij  "b  TH  (26) 

J  =  1 

where  mj  =  bt{f  dci.)  ®as(0cy).  Thus,  Z 12 (w)  becomes 

M  pT 

Zn{u)  =  y^  w'idiS*i(t  -  n)dt  (27) 

7=1  Jo 

Since  7 y  is  a  complex  random  variable,  we  assume  is  a 
complex  Gaussian  random  variable.  So,  it  is  not  difficult  to 
prove  that  Z 12  is  a  complex  Gaussian  noise  n(u).  The  output 
of  the  branch  1  becomes 

Zi{v)  «  |£?ai(«)L1(fl1J(ii)  +  r*(«)|  (28) 

Similarly,  the  output  of  the  ith  branch  (i  =  1,2,  ...,  M)  is 

Ziiu)  «  \Eai{u)Li(9iJdi)  +  n{u)\  (29) 

Based  on  (29),  we  can  recognize  that  the  output  of  the  ith 
branch  is  composed  of  the  signal  from  the  radar  sensor  i  and 
noise.  Note  that  when  we  compute  Zi{u),  we  still  have  to 
estimate  the  interference-plus-noise  covariance  matrix. 

Now,  our  objective  is  to  combine  all  the  outputs  of  all 
branches.  We  will  use  diversity  combining  method  to  combine 
these  outputs.  In  this  paper,  we  are  interested  in  using  the 
spatial-temporal-frequency  diversity  in  RSN  to  solve  the  target 
detection  problem.  The  purpose  of  detection  problem  is  to 
figure  out  the  presence  and  motion  of  the  desired  targets  such 
as  missiles,  tanks,  fighter  aircrafts,  other  tactical  weapons  from 
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the  enemy,  illegal  intruders  at  the  border  of  the  country,  over¬ 
speeded  vehicles  or  strange  ships  at  sea,  etc.  In  this  paper,  we 
will  apply  the  spatial-temporal-frequency  diversity  to  improve 
the  detection  performance  and  use  a  maximum  likelihood 
criterion  to  analyze  the  detection  performance  of  RSN  for  both 
non-fluctuating  targets  and  fluctuating  targets.  The  detection 
problem  in  RSN  can  be  formulated  as  follows: 

Hq  :  Target  is  not  present 

H i  :  Target  is  present  (30) 


A.  Non-fluctuating  targets 

Non-fluctuating  targets  can  be  modeled  as  the  Swerling 
0  or  equivalently  Swerling  V  [11],  [17].  The  radar  cross 
section  (RCS)  q,(w)  of  non-fluctuating  targets  is  constant 
and  unknown.  We  assume  that  ai(it)  =  a2{u)  =  •••  = 
aM(w)  =  a(u).  Under  hypothesis  Hq,  Zflu)  follows  the 
Rayleigh  distribution.  The  probability  density  function  (pdf) 
of  Zi  is 

2  z-  z2 

f{zi\H0)  =  —±exp[—^]  (31) 


Under  hypothesis  H i,  Zflu)  follows  the  Rician  distribution. 
The  pdf  of  Zi  with  the  parameter  m*  is 


IMHO  = 


+  (32) 


B.  Fluctuating  targets 

In  practice,  RCS  is  normally  fluctuating.  Based  on  different 
combinations  of  pdf  and  decorrelation  (pulse  to  pulse  or  scan 
to  scan),  Swerling  [17]  proposed  four  Swerling  models.  He 
also  showed  that  the  statistics  associated  with  Swerling  I  and 
II  models  are  applied  to  targets  consisting  of  many  small 
RCS  scatters  of  comparable  RCS  values,  while  the  statistics 
associated  with  Swerling  III  and  IV  models  are  applied  to 
targets  consisting  of  one  large  scatter  and  many  small  equal 
RCS  scatters  [11], 

In  this  paper,  we  focus  on  the  Swerling  II  model.  The  mag¬ 
nitude  |a(u)|  of  Swerling  II  targets  fluctuates  independently 
from  pulse  to  pulse  according  to  a  chi-square  probability 
density  function  with  two  degree  of  freedom,  i.e.,  a  Rayleigh 
probability  density  function. 

a(u)  =  ai(u)  +  jaQ(u)  (37) 

where  Qj(u)  and  ag(tt)  follow  Gaussian  distribution  with  the 
variance  p2/ 2  for  each  branch  I,  Q.  Under  hypothesis  Hq, 
Zi(u)  follows  the  Rayleigh  distribution.  The  pdf  of  Zflu)  is 

2z-  z2 

f(zi\H0)  =  ~exp[-^}  (38) 

Under  hypothesis  Hi,  Zflu)  follows  the  Rayleigh  distribution. 
The  pdf  of  Zi(u)  is  given  by 


where  m,  =  Ea(u)Li(6i,  fdi),  a2/2  is  the  noise  power  for 
each  branch  I,  Q,  and  J0(.)  is  the  zero-order  modified  Bessel 
function  of  the  first  kind.  We  assume  that  Z\,Z2,  ...,Zm  are 
independent  random  variables.  Let  Z  =  [Zi,  Z2, ZM],  the 
joint  pdf  of  the  variable  Z  for  each  hypothesis: 


f(z\H0) 


i= 1 
M 


n  ^expl 


zf  +  m3 


(33) 


Ko(^)  (34) 


Our  objective  is  to  decide  whether  or  not  a  target  is  present 
based  on  the  received  signal  at  the  cluster-head.  Using  the 
maximum  likelihood  criterion,  we  can  derive  the  detection 
threshold  T,  false  alarm  probability  Pfa  and  miss-detection 
probability  Pmd- 


r  oo  zm-i 

Ffa  =  JT 

=  <35) 

pOO  _ 

Pmd  =  1-/ 

JT  Mx 

(36) 


where  x  is  the  average  signal-to-clutter-plus-noise  ratio 
(SCNR),  Jm-i(-)  is  the  (M  —  1)  order  modified  Bessel 
function  of  the  first  kind,  and  I  (a,  b)  is  the  incomplete  gamma 
function. 


f(zi\Hi)  =  ^exp(-^)  (39) 

where  cr,  =  .  We  assume  that 

Zi,  Z2,  •••,  Zm  are  independent  random  variables.  Let  Z  = 
[Zi,Z2,--,ZM],  the  joint  pdf  of  the  variable  Z  for  each 
hypothesis: 


f(z\H0) 

fWi) 


M 


1  =  1 

M  2zi  2 


n 


cp[- 


(40) 

(41) 


Our  objective  is  to  decide  whether  or  not  a  target  is  present 
based  on  the  received  signal  at  the  cluster-head.  Using  the 
maximum  likelihood  criterion,  we  can  derive  the  detection 
threshold  T,  false  alarm  probability  Pfa  and  miss-detection 
probability  Pmd- 


/•oo 

=  jT  W^ye~‘dz 

-  1-7(^'M-1)  <42> 
r  oo  M- 1 

p«°  ~  l-JT  (i dzm 


where  x  is  the  average  SCNR,  and  I  (a,  b )  is  the  incomplete 
gamma  function. 
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V.  Simulation  results  and  performance  analysis 
A.  Simulated  data  model 

In  this  paper,  we  use  the  modified  Joint  Domain  Localized 
(JDL)  algorithm  proposed  by  Adve  et.al  [12]  to  determine 
the  space-time  weights  at  sensors.  The  data  generation  scheme 
uses  the  physical  model  presented  by  Ward  [8], 

As  mentioned  in  the  section  III,  the  clutter  is  modeled 
as  a  sum  of  the  contributions  of  many  discrete  far  field 
sources.  In  this  paper,  amplitude  of  each  discrete  source  is 
a  complex  Gaussian  random  variable  whose  average  power  is 
set  by  a  chosen  clutter-to-noise  ratio  (CNR).  The  normalized 
Doppler  shift  associated  with  each  clutter  source  depends  on 
the  velocity  of  the  platform. 

Thermal  noise  is  modeled  as  a  Gaussian  white  noise  pro¬ 
cess.  The  average  power  is  set  to  unity  allowing  the  clutter  and 
target  powers  to  be  referenced  to  the  white  noise  power.  Sim¬ 
ulations  do  not  consider  the  effects  of  Jammers.  Parameters 
used  in  simulations  are  listed  in  the  Table  I  [12], 

TABLE  I  -  Parameters  used  in  simulations 


Parameters 

Values 

Array  elements 

8 

Pulses 

8 

Element  spacing 

Ai/2 

Pulse  Repetition  Frequency  (PRF) 

1024  Hz 

The  number  of  clutter  sources 

181 

Target  normalized  Doppler  shift 

1/3 

Thermal  noise  power 

Unity 

Clutter  to  noise  ratio  (CNR) 

50  dB 

The  number  of  Doppler  bins  in  LPR 

3 

The  number  of  Angle  bins  in  LPR 
(LPR:  Local  Processing  Region) 

3 

The  interference-plus-noise  covariance  matrix  Rd  is  given 

Rd  =  Rn  +  Pc{k)Rc  (44) 

where  R n  is  the  covariance  of  noise,  Rc  is  the  clutter 
covariance  computed  in  (16),  and  pc(k)  is  a  random  variable 
used  to  model  the  clutter  power  of  the  kth  range  cell.  pc(k) 
often  follows  Weibull  or  gamma  distribution  [19]  [20],  In 
homogeneous  environments,  the  average  clutter  power  does 
not  depend  on  k.  We  assume  the  average  CNR  equals  50  dB. 

B.  Detection  performance  analysis  of  the  radar  sensor  net¬ 
work 

In  RSN,  each  radar  sensor  transmits  a  known  waveform. 
This  waveform  is  reflected  back  from  the  target  toward  the 
receiving  sensor.  RSN’s  tasks  are  to  detect  the  existence  of 
the  target  and  to  estimate  its  unknown  parameters,  e.g.,  range 
speed  and  direction.  In  this  paper,  we  use  the  spatial-temporal- 
frequency  diversity  in  RSN  to  improve  the  detection  perfor¬ 
mance.  Fig.  3(a)  presents  the  probability  of  miss-detection 
Pmd  as  a  function  of  SCNR  and  Pfa  =  10~6  while  Fig.  3(b) 
presents  the  miss-detection  probability  Pmd  as  the  function  of 
false  alarm  probability  Pfa >  SCNR  =  8dB  and  SCNR  = 
10 dB  for  non-fluctuating  targets.  Similarly,  Fig.  4(a)  presents 


the  probability  of  miss-detection  Pmd  as  a  function  of  SCNR 
and  Pfa  =  10-6  while  Fig.  4(b)  presents  the  miss-detection 
probability  Pmd  as  the  function  of  false  alarm  probability 
PFA,  SCNR  =  10 dB  and  SCNR  =  12 dB  for  fluctuating 
targets. 


Fig.  3.  Non-fluctuating  target  models:  (a)  Probability  of  miss-detection 
Pmd  as  a  function  of  SCNR,  Pfa  —  10  6,  (b)  Miss-detection  probability 
Pmd  as  the  function  of  false  alarm  probability  Pfa-  SCNR.  =  8 dB  and 
SCNR  =  10  dB 

Based  on  these  results,  we  recognize  that  the  probability 
of  miss-detection  Pmd  at  the  same  SCNR  decreases  when 
the  number  of  radar  sensors  in  the  network  increases,  e.g.,  at 
SCNR=10  dB,  Pmd  of  the  5-radar  RSN  is  much  lower  than 
that  of  the  2-radar  RSN.  It  is  desirable  for  Pmd  to  be  as  low  as 
possible.  In  the  real  world,  Pmd  less  than  10%  is  reasonable. 
We  can  observe  that  it  is  very  difficult  to  achieve  this  Pmd 
with  a  single  radar  at  low  Pfa  and  if  possible,  the  SCNR 
must  be  very  high.  However,  the  5-radar  RSN  can  maintain 
very  low  Pmd  at  a  low  Pfa- 

We  also  notice  that  it  requires  more  SCNR  with  fluctuating 
targets  than  with  non-fluctuating  targets  to  achieve  the  same 
Pmd-  For  example,  when  we  use  5  radar  sensors  and  Pmd  is 
about  10%,  SCNR  is  9.3  dB  for  fluctuating  targets  but  less  than 
9  dB  for  non-fluctuating  targets.  At  the  same  values  of  Pmd 
and  SCNR,  Pfa  for  non-fluctuating  targets  is  lower  than  for 
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(a) 


rFA 

(b) 


Fig.  4.  Fluctuating  target  models:  (a)  Probability  of  miss-detection  Pm  d  as 
a  function  of  SCNR,  Pfa  =  10~6,  (b)  Miss-detection  probability  Pmd  as 
the  function  of  false  alarm  probability  Pfa>  SCNR  —  10 dB  and  SCNR  = 
12  dB 

fluctuating  targets.  So,  depending  on  specific  scenarios,  we  can 
choose  Pfa  and  SCNR  logically  in  order  to  get  the  desired 
detection  performance. 

VI.  Conclusion 

In  this  paper,  we  investigated  and  applied  the  spatial- 
temporal-frequency  diversity  to  improve  the  detection  perfor¬ 
mance  of  RSN.  We  also  proposed  a  receiver  for  diversity 
combining  in  RSN.  The  probability  of  miss-detection  as  a 
function  of  the  false  alarm  probability  and  the  signal-to-clutter- 
plus-noise  (SCNR)  is  analyzed  for  both  non-fluctuating  targets 
and  fluctuating  targets.  Simulation  results  showed  that  the 
detection  performance  of  our  diversity  scheme-based  radar 
sensor  network  is  much  better  than  that  of  single  radar  system 
using  the  spatial-temporal  diversity  only. 

However,  in  this  paper,  we  do  not  consider  other  factors  that 
affect  the  performance  of  RSN  such  as  crab  angle,  mutual 
coupling  and  beam  mismatch  between  target  and  steering 
vector.  Although  we  also  study  interference  in  nonhomoge- 
neous  environments,  we  only  limit  our  simulations  to  the 
homogeneous  environments.  In  subsequent  papers,  we  will 


investigate  methods  to  solve  the  above  restrictions  and  de¬ 
velop  our  scheme  to  solve  advanced  problems  in  radar  sensor 
networks:  target  search  and  target  recognition. 
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Abstract — Virtual  multiple-input  multiple-output  (MIMO) 
communication  architecture  is  popularly  used  in  Wireless 
Sensor  Networks  (WSN)  recently  to  counteract  the  severe 
communication.  Channel  state  information  (CSI)  is  a  crucial 
factor  to  MIMO  communication  system.  Waterfilling  strategy 
based  on  the  CSI  at  transmitter  (CSIT)  is  an  optimal 
power  allocation  scheme,  especially  at  low  signal-to-noise 
ratio  (SNR)  scenario.  However,  CSIT  is  usually  imperfect  in 
practice.  In  this  paper,  we  will  analyze  the  how  the  estimation 
error  would  impact  the  channel  capacity  for  the  MIMO 
system  at  low  SNR  region. 

Index  Terms — Wireless  sensor  networks,  channel  capacity, 
channel  state  information,  estimation  error. 

I.  Introduction 

The  infusion  and  maturation  of  the  Micro  Mechan¬ 
ical  System(MEMS),  computations,  and  wireless  com¬ 
munication  technologies  has  advanced  the  development 
of  Wireless  Sensor  Networks  (WSN).  In  WSN,  a  large 
amount  of  low  cost  sensor  nodes  are  densely  deployed  to 
monitor  the  environment  of  interest.  Due  to  the  various 
applications  [7]  [16],  WSN  has  generated  flurry  of  research 
activity. 

The  sensor  nodes  are  miniature  devices  equipped  with 
a  sensor,  a  transceiver  and  the  necessary  electronic  cir- 
cuites  and  able  to  collect  and  forward  information[10]. 
These  nodes  are  designed  to  be  deployed  randomly  or 
strategically  in  area  and  left  operating  until  their  battery 
is  totally  used  up.  Obviously,  WSN  is  power,  bandwidth, 
and  complexity  limited. 

Virtual  multiple-input  multiple-output  (MIMO)  commu¬ 
nication  architecture  have  recently  been  applied  in  energy- 
constraint,  distributed  WSN  so  as  to  economize  energy 
consumption  [5]  [14]  [15].  In  these  implementations,  the 
underlying  MIMO  concepts  include  the  simple  Alamouti 
scheme  [1]  and  the  virtual  Bell  Labs  Layered  Space-time 
(V-BLAST)  architecture  [23], 

In  addition  to  the  potential  advantage  of  energy  effi¬ 
ciency  in  MIMO  ,  the  theoretical  capacity  gain  of  MIMO 
channels  is  also  enormous.  It  has  been  proved  that  MIMO 
capacity  nominally  increases  linearly  with  the  number 


of  antenna  elements.  However,  MIMO  channel  capacity 
depends  heavily  on  how  accurately  we  know  the  channel. 
Specifically,  when  the  instantaneous  channel  gains,  called 
channel  state  information  (CSI),  are  known  perfectly  at 
both  the  transmitter  and  the  receiver  of  the  MIMO  system, 
the  transmitter  can  adapt  its  transmission  strategy  relative 
to  the  instantaneous  channel  state,  so  the  maximum  MIMO 
channel  capacity  can  be  achieved. 

Channel  capacity  for  MIMO  channel  has  been  inten¬ 
sively  studied  in  different  scenarios  since  the  pioneer 
work  by  Winters  [18]  and  Telatar  [17].  Most  of  them 
modeled  the  channel  as  fully  scattering,  i.e.,  Rayleigh 
fading  channel.  The  Rayleigh  fading  model  is  a  reasonable 
assumption  for  many  fading  environments  encountered  in 
practical  communication  systems.  However  in  some  other 
cases,  (e.g.,  in  WLAN  application,  and  for  sure,  WSN 
fits  in  this  scope),  there  is  a  strong  deterministic  Line- 
Of-Sight  (LOS)  component  between  the  transmitter  and 
receiver,  which  gives  rise  to  Rician  fading  model.  The 
Rician  distribution  is  characterized  by  the  Rice  factor, 
k,  which  reflects  the  relative  strength  of  the  direct  LOS 
path  component.  When  k  =  0,  this  model  reduces  to 
Rayleigh  fading  and  k  — >  oo  the  model  reduces  to  AWGN 
channel.  The  capacity  of  a  Rician  channel  with  receiver 
CSI  but  without  any  knowledge  even  the  Rice  factor  at 
the  transmitter  has  been  studied  in  [8]  [6].  [13]  assumes 
that  there  is  perfect  CSI  at  receiver  (CS1R),  meanwhile  the 
transmitter  has  partial  CSI,  i.e.,  the  knowledge  of  the  value 
of  the  Rice  factor. 

In  this  paper,  we  assume  that  the  transmitter  has 
full/partial  CSI,  however,  there  is  error  of  the  CSI.  We 
will  investigate  how  the  error  impact  the  channel  capacity. 

The  remainder  of  this  paper  is  organized  as  follows. 
In  section  II  we  briefly  present  the  relay  virtual  MIMO 
architecture  for  WSN  and  the  underlying  one-hop  MIMO 
system  model.  Whereafter,  we  investigate  the  channel 
capacity  with  imperfect  CSI  in  sectionlll.  The  numerical 
results  will  be  presented  in  sectionIV.  And  we  make  the 
conclusion  and  propose  the  future  work  in  sectionV. 


II.  Virtual  MIMO  architecture  and  System 
Model 


A.  Virtual  MIMO  architecture 

In  this  section,  we  will  first  briefly  review  the  virtual 
MIMO  architecture  applied  in  WSN. 

The  nodes  in  a  WSN  is  usually  of  small  dimensions. 
Thus  it  may  not  be  realistic  to  assume  these  inexpensive 
sensor  nodes  are  equipped  with  multiple  antennas.  [5]  first 
proposed  to  realize  MIMO  communication  architecture  in  a 
WSN  consisting  of  single-antenna  sensor  nodes  via  sensor 
cooperation.  The  MIMO  architecture  applied  here  is  shown 
in  Fig.  1 

WIRELESS  SENSOR  NETWORK 
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Fig.  1.  Virtual  MIMO  architecture  in  WSN 

There  is  a  transmitter  cluster,  which  is  composed  of 
a  group  of  Nt  cooperative  sensor  nodes,  communicating 
with  a  receiver  cluster,  which  is  compose  of  a  set  of  Nr 
cooperative  sensor  nodes.  In  order  to  fulfil  the  communi¬ 
cation,  the  transmitter  cluster  should  do: 

•  broadcasting  the  data  within  the  cluster,  so  that  all  the 
active  sensor  nodes  inside  the  cluster  can  encode  and 
send  out  the  data  during  the  MIMO  transmission; 

•  transmitting  the  data  via  the  Nt*Nr  MIMO  channel. 

These  two  functionalities  should  be  carried  out  in  two 

orthogonal  channel.  Here,  we  assume  time  division  (TD) 
scheme  is  applied.  These  two  TD  channels  are  referred 
as  Intracluster  channel  and  Intercluster  channel.  For  Intr¬ 
acluster  channel,  it  is  falling  into  the  broadcast  capacity 
region.  In  the  Intercluster,  the  Nt  sensor  nodes  jointly 
transmit  data  to  the  receiver  cluster  by  using  a  Nt  *  Nr 
virtual  MIMO  channel.  So,  we  will  mainly  focus  on  the 
Intercluster  MIMO  channel. 

B,  System  model 

In  this  paper,  we  only  consider  single  user  scenario.  Nt 
transmitter  sensor  nodes  and  Nr  receiver  sensor  nodes 
equal  to  Nt  transmitting  antenna  and  Nr  receiving  an¬ 
tenna  respectively.  Throughout,  we  assume  that  Nr  >  Nt- 
The  discrete-time  received  signal  in  such  a  system  can  be 
modeled  in  matrix  form  as 

y(*)  =  H(i)x(i)  +  77(1)  (1) 

where  at  symbol  time  i,  x(i)  is  the  Nt  x  1  vector  of 
transmitted  signal  on  the  NT  transmitting  sensor  nodes, 


y(i)  is  the  Nr  x  1  vector  of  received  signal  on  the 
Nr  receiving  sensor  nodes,  and  7 ](i)  is  the  Nr  x  1 
additive  receiver  noise  vector.  We  assume  the  components 
of  the  noise  77(f)  to  be  independent,  zero-mean,  circularly 
symmetric  complex  Gaussian  with  independent  real  and 
imaginary  parts  having  equal  variances.  The  noise  is  also 
assumed  to  be  independent  with  respect  to  the  time  index, 
and  Rrj  =  £[77(7)77(7)"]  =  crpNn,  where  INr  €  CNrXNr 
denotes  the  identity  matrix,  and  AH  denotes  the  hermitian 
conjugate  of  the  matrix  A. 

The  CSI  is  the  channel  matrix  H(i)  e  £NrxNt.  The 
(771,  n)-th  element  of  the  matrix  H(i)  represents  the  fading 
coefficient  value  at  time  i  between  the  772-th  receiving 
sensor  node  to  the  71-th  transmitting  sensor  node.  The 
H(i)  corresponding  to  each  channel  use  is  assumed  to  be 
independent  from  that  of  other  uses,  i.e.,  H(i)  and  H(j) 
are  independent  when  i  ^  j.  In  this  sense,  the  time  index 
f  can  be  dropped  so  as  to  simplify  notation. 

In  Rician  fading  the  elements  of  H  are  nonzero  mean, 
complex  Gaussian  variables.  Hence,  we  can  model  the 
channel  matrix  H  as  a  sum  of  two  components,  a  fixed 
(LOS)  component  and  a  variable  (or  scattered)  component, 

H  =  H  +  H  (2) 

and 

H  =  £{H} 

=  -^=(1  +i)^Nn,NT  (3) 

where  At nr,nt  is  defined  as  the  Nr  x  Nt  matrix  of 
all  ones.  E{  }  is  an  expectation  operation.  H  is  a  com¬ 
plex  distributed  matrix  with  zero-mean,  denoted  as  H  ~ 
Nc(0,1nt  ®£),  with  the  probability  density  function 
(pdf)  [4]  [12] 

f~  (H)  =  _ _ — _ e -tr{T.~lHHH]  m 

^NTNn^NTe  ’  W 

where  tr  denotes  the  trace  of  a  matrix,  E  is  the  Hermitian 
covariance  matrix  of  the_columns  (assumed  to  be  the  same 
for  all  the  columns)  of  H, 

E  =  2  ex2  IjvR .  (5) 

The  strength  of  the  LOS  component  can  be  indicated  by 
the  Rician  K  factor, 

Id2 

K=  10  log10(!£-)dB.  (6) 

In  the  case  of  Rayleigh  fading,  which  is  an  extreme 
scenario  of  Rician  fading,  K  =  0,  which  also  implies  that 
H  =  0.  The  parameters  should  be  normalized  as 

Im|2  +  2  cr2  =  1,  (7) 

so  that  the  signal-to-noise  ratio  (SNR)  will  not  be  scaled 
by  the  channel.  We  also  assume  that  the  channel  is  block 


fading  [3],  i.e.,  H  remains  constant  over  T  >  Nt  symbol 
periods  and  changes  in  an  independent  fashion  from  block 
to  block. 

III.  One  Hop  Channel  Capacity  of  MIMO  Rician 

FADING  CHANNEL 

The  CSI  is  the  channel  matrix  H.  Based  on  different 
knowledge  of  the  CSI,  we  can  get  different  system  perfor¬ 
mance,  including  channel  capacity. 

A.  Error  Model 

In  a  realistic  scenario,  however,  the  CSI  is  generally 
imperfect.  The  receiving  sensor  node  can  estimate  the  CSI, 
i.e.,  the  matrix  H,  using  training  sequences,  e.g.,  pilot 
symbols.  For  CSIT,  basically  there  are  two  ways  [2], 

•  The  transmitting  sensor  nodes  can  estimate  the  chan¬ 
nel  using  the  signals  received  in  the  reverse  link,  and 
use  it  as  an  estimate  in  the  forward  link,  because  of  the 
channel  reciprocity  principle.  During  the  procedure 
of  estimation,  estimation  error  will  be  introduced 
without  doubt.  Suppose  the  estimation  is  unbiased, 
the  estimated  CSI,  H  can  be  formulated  as 

H  =  H  +  £  (8) 

where  £  is  the  estimation  error.  The  entries  of  £ 
are  assumed  to  be  i.i.d.  and  zero-mean  circularly 
symmetric  complex  Gaussian.  £  is  independent  from 
the  real  channel  realization. 

•  The  transmitter  can  obtain  the  CSI  through  a  feedback 
channel  from  the  receiver  to  the  transmitter.  Besides 
the  Gaussian  estimation  error  at  the  receiver,  due 
to  the  finite  capacity  of  the  feedback  channel,  the 
channel  response  has  to  be  quantized,  which  will 
introduce  the  another  quantization  noise  to  the  CSIT. 
The  receiving  sensor  nodes  can  uniformly  quantize  the 
real  and  imaginary  parts  of  all  the  entries  of  the  CSI 
matrix  H.  Suppose  the  quantization  step  size  is  Ag, 
assume  that  there  will  be  no  error  in  the  transmission, 
the  quantization  signal-to-noise  ratio  SNRq  is  given 
in  [19]  as, 

SNRq  =  (9) 

where  a\  is  the  variance  of  each  component  of  H. 
Notice  that  when  the  Aq  is  sufficiently  small,  the 
quantization  SNRq  will  be  very  large. 

We  will  focus  on  the  Gaussian  estimation  error  in  the  fol¬ 
lowing  analysis.  And  we  make  the  following  assumption. 

•  The  transmitting  sensor  nodes  have  a  total  power 
constraint  P,  however,  they  can  adapt  their  power 
allocation  according  to  the  channel  fading  so  as  the 


maximize  capacity.  The  power  constraint  condition 
can  be  mathematically  presented  as, 

E(P)  =  E(tr(Q))  <  P.  (10) 

•  Either  there  exist  a  perfect  and  instantaneous  feedback 
channel  from  the  receiver  to  the  transmitter,  or  the 
delay  from  the  obtaining  reverse-channel  information 
to  applying  the  information  to  the  forward-link  is 
negligible.  So  the  only  error  we  consider  is  the 
estimation  error  in  (8). 

B.  Channel  Capacity  with  Perfect  CSIR  and  CSIT 

If  CSI  is  known  both  at  the  transmitting  and  receiving 
sensor  nodes,  the  transmitting  sensor  nodes  can  adapt  its 
transmission  strategy  according  to  this  CSI.  The  channel 
capacity  of  such  a  MIMO  system  with  perfect  CSIR  and 
CSIT  is  nothing  but  the  average  of  the  capacities  achieved 
with  each  channel  realization.  The  formula  is  given  in  [9] 

C  =  Eh[  max  log|INR  +  HQHH|]  (11) 

Q:tr(Q)  =  P 

where  Q  is  the  input  covariance  matrix  as 

Q  =  E(xxh).  (12) 

C.  Optimal  power  allocation  strategy:  waterfilling 

With  CSI  at  both  the  receiver  and  the  transmitter, 
optimal  power  allocation  strategy  based  on  the  H  should 
be  applied  to  maximize  the  channel  capacity  in  (11)  [22]. 
The  joint  singular  value  decomposition  (SVD)  and  water¬ 
filling  power  allocation  technique  provides  the  optimum 
solution  [20], 

The  SVD  of  the  channel  matrix  H  is  presented  as 

H  =  UAVh  (13) 

where  U  G  CNrxNr  is  an  unitary  matrix  of  orthonormal- 
ized  eigenvector  of  HHT,  V  €  CNtXNt  is  an  unitary 
matrix  of  orthonormalized  eigenvector  of  HTH,  and  A  e 
£NRxNT  js  a  rectangular  matrix  whose  diagonal  elements 
are  non-negative  real  numbers  and  whose  off-diagonal  el¬ 
ements  are  zero.  The  diagonal  elements  Ai,  A2, . . . ,  ^Nmin 
denotes  the  ith  singular  value  of  H,  and  Aj  >  A2  >  ■  •  •  > 
A Nmin  >  0,  where  Nmin  =  min(NT,NR). 

The  MIMO  channel  can  then  be  represented  as  a  parallel 
channel  [22]  based  on  the  ordered  singular  value  A ;  as 

Vi  —  A i%i  “F  Iji,  i  —  1,  ■  *  ‘  j  Nmin  (14) 

and 

x  =  VKx, 

y  =  UHy, 

rj  =  XJht]. 

Substituting  (13)  into  (11),  we  can  get 


Nmin  X2 

C  X> +  <15> 

The  transmitting  powers  are  allocated  to  each  sub-channel 
based  on  their  strength  according  to  the  waterfilling  strat¬ 
egy, 

5  =  (M-^)+  (16) 

where  (A)+  =  max(0,  A),  and  p  is  the  waterfilling  level 
which  should  be  chosen  so  that  the  total  power  constraint 
is  satisfied: 

E(^-4i)  =  i-  07) 

<=1  « 

Therefore,  the  maximum  channel  capacity  can  be  ob¬ 
tained  as: 

Cmax=Y,  l0g(l  +  ^)  (18) 

t,Pi>0  ^ 

D.  Channel  Capacity  with  imperfect  CSIT 

When  there  is  error  in  the  estimation  as  discussed  in 
section  III-A,  what  the  transmitter  knows  is  the  estimated 
channel  matrix  H  in  (8).  Hereby,  the  decomposition  of  the 
MIMO  channel  and  the  application  of  waterfilling  strategy 
in  section  III-C  are  both  based  on  H,  which  contains  zero- 
mean  Gaussian  Noise.  Apparently,  the  singular  value  A, 
obtained  from  H  will  be  different  from  A;.  Consequently, 
the  maximum  channel  capacity  in  (18)  is  no  long  tenable 
because  the  power  allocation  is  not  optimized,  we  will 
show  how  the  estimation  error  impact  the  channel  capacity 
in  section  IV. 

IV.  Numerical  Results  and  Discussions 

Numerical  results  are  presented  to  show  how  Gaussian 
estimation  errors  degrade  the  channel  capacity  in  i.i.d. 
Rician  fading  MIMO  channel  based  on  Monte-Carlo  sim¬ 
ulations. 

Two  MIMO  systems,  where  Nt  =  Nr  =  3  and 
Nr  =  Nr  =  6,  respectively  were  simulated.  The  MIMO 
channels  were  treated  as  9  and  36  i.i.d.  single-input- 
single-output  (SISO)  Rician  channels  applying  the  Jakes’s 
model  [21]  [11], 

In  Figure  2,  we  compare  the  average  channel  capacity 
using  equal  power  allocation,  waterfilling  strategy  with 
perfect  CSIT,  and  waterfilling  strategy  with  channel  es¬ 
timation  error  as  <r|  =  0.1,0.25,0.5,1  respectively  in  a 
Nt  =  3,N/{  =  3  MIMO  system  with  the  Rician  fading 
parameter  as  if  =  10  for  all  the  independent  3x3  =  9 
sub-channels,  and  f,i  is  randomly  chosen  from  0  ~  30 Hz 
for  sub-channels. 


Fig.  2.  Average  Channel  Capacity  for  Nt  =  Nr  =  3  in  Rician  fading 
MIMO  system  with  K  =  10. 


We  can  see  clearly  that  the  waterfilling  scheme  performs 
much  better  than  the  equal  power  allocation  scheme,  which 
does  not  need  CSIT,  even  with  estimation  error,  especially 
at  the  low  SNR.  At  high  SNR,  the  difference  between 
waterfilling  and  equal  power  allocation  scheme  becomes 
slimmer. 

We  then  set  the  Rice  factor  K  =  0  for  all  the  sub¬ 
channels,  which  means  it  is  Rayleigh  fading.  The  results 
are  shown  in  Figure  3. 


Fig.  3.  Average  Channel  Capacity  for  Nt  =  Nr  =  3  in  Rayleigh 
fading  MIMO  system. 

For  Rayleigh  fading  MIMO  channel,  channel  capacity 
is  more  sensitive  to  the  SNR,  but  less  sensitive  to  the 
estimation  error. 

Figure  4  5  show  the  results  of  the  simulation  of  Nr  = 


Nr  =  6  system. 


Fig.  4.  Average  Channel  Capacity  for  Nt  —  Nr  =  6  in  Rician  fading 
MIMO  system  with  K  =  10. 


Fig.  5.  Average  Channel  Capacity  for  Nt  =  Nr  —  6  in  Rayleigh 
fading  MIMO  system. 

We  notice  that  the  average  channel  capacities  of  equal 
power  allocation  strategy  are  larger  than  the  ones  of 
waterfilling  strategy  at  high  SNR.  When  we  perform  the 
waterfilling  scheme,  some  of  the  decomposed  parrallel 
channels  may  not  be  assigned  any  power,  so  that  some 
of  the  diversity  gain  is  lost. 

We  compare  the  average  channel  capacities  of  Nt  = 
Nr  —  3  and  Nt  =  Nr  =  6  MIMO  systems  at 
Figure  6.  The  Rician  fading  parameters  K  and  frj  for 
each  independent  sub-channel  are  chosen  from  0  ~  10 
and  0  ~  30 Hz  respectively. 

Without  any  doubt,  the  average  channel  capacity  of  the 


Nt  =  Nr  =  6  MIMO  system  is  much  larger  than  the  one 
of  the  Nt  =  Nr  =  3  MIMO  system. 


Fig.  6.  Average  Channel  Capacities  for  Nt  =  Nr  =  3  and  Nt  = 
Nr  =  6  MIMO  systems. 


Figure  7  shows  the  average  channel  capacities  get  from 
Nr  =  3,  Nt  =  6  and  Nr  =  6,  NT  =  3  MIMO  systems. 


SNR(dB) 


Fig.  7.  Average  Channel  Capacities  for  Nr  =  3,  Nt  =  6  and  Nr  = 
6,  Nt  =  3  MIMO  systems. 

For  equal  power  allocation  algorithm,  the  channel  capac¬ 
ities  for  the  two  systems  are  exactly  the  same.  However, 
for  waterfilling  strategy,  more  gains  can  be  obtained  from 
the  receiving  part. 

V.  Conclusions  and  Future  Works 
A.  Conclusions 

Virtual  MIMO  structure  is  very  attractive  to  WSN  due  to 
its  potential  huge  diversity  gain.  WSN  is  power  constraint, 


however,  with  CSIT,  waterfilling  algorithm  can  be  applied 
to  optimize  the  power  allocation  in  each  sub-channel  of 
the  MIMO  system  based  on  the  estimated  channel  matrix 
H,  so  as  to  maximize  the  channel  capacity  of  the  system 
even  in  deep  fading  scenario.  However,  in  reality,  both 
the  estimation  error  and  the  quantization  noise  will  be 
introduced  when  estimate  the  channel. 

In  this  paper,  we  first  discuss  the  virtual  MIMO  structure 
in  WSN,  then  set  up  the  MIMO  system  model  focused 
on  one-hop  intercluster  transmission.  Afterwards,  the  error 
model  of  the  channel  estimation  is  investigated,  and  how 
the  estimation  error  will  impact  the  optimal  waterfilling 
strategy  is  analyzed.  Numerical  results  were  presented 
based  on  the  Monte-Carlo  simulations. 

B.  Future  work 

•  In  WSN,  transmission  are  generally  multi-hopped,  as 
shown  in  section  II-A.  In  this  paper,  we  only  focused 
on  the  one-hop  channel  capacity,  however,  the  overall 
capacity  including  relaying  hops  should  be  derived. 

•  We  only  consider  the  estimation  error  in  this  paper. 
Delay  of  the  estimation,  i.e.,  H(f  -  T)  j=-  H(f),  and 
only  H (t  —  T)  is  available,  can  also  impact  the  system 
performance. 

•  Wireless  sensor  nodes  are  usually  small-sized  with 
limited  computing  capability.  Waterfilling  algorithm 
is  based  on  the  singular  values  of  MIMO  channel 
matrix,  which  needs  considerable  computing.  Besides 
the  overhead  to  the  transmitting  sensor  nodes  is  also 
not  negligible.  Simple  but  efficient  power  allocation 
algorithms  should  be  developed. 
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Abstract — The  multiple-input  and  multiple- 

output(MIMO)  system  can  be  used  to  increase  throughtput 
throught  multiplexing  or  to  improve  PLD(Packet  Loss 
Ration)  throught  diversity.  However,  the  throughtput  and 
PLR  will  also  be  determined  by  MAC  layer  and  network 
layer  protocols.  In  this  paper,  we  coordinate  physical 
layer,  data-link  layer  and  network  layer  for  cross-layer 
design.  Performance  analysis  and  simulations  show  that 
throughput  and  packet  performance  will  have  different 
performance  compared  with  only  consider  the  MIMO 
scheme  in  physical  layer  as  the  increase  of  the  number  of 
transmitters. 

I.  Introduction 

The  demand  for  energy  efficiency  and  Quality  of 
Service  (QoS)  in  wireless  sensor  networks  is  growing 
in  a  rapid  speed.  A  strict  layered  design  is  not  flexible 
enough  to  cope  with  the  dynamics  of  the  wireless  sensor 
networks  [1],  To  enhance  the  energy  efficiency  and  QoS, 
we  consider  the  combination  of  physical  layer,  data-link 
layer  and  network  layer  together,  a  cross-layer  approach. 
Cross-layer  design  could  introduce  the  layer  interdepen¬ 
dencies  to  optimize  overall  network  performance. 

The  general  methodology  of  cross-layer  design  is  to 
maintain  the  layered  architecture,  capture  the  impor¬ 
tant  information  that  influence  other  layers,  exchange 
the  information  between  layers  and  implement  adaptive 
protocols  and  algorithms  at  each  layer  to  optimize  the 
performance. 

However,  cross-layer  design  can  produce  unintended 
interactions  among  protocols,  such  as  an  adaptation 
loops.  It  is  hard  to  characterize  the  interaction  at  different 
layers  and  joint  optimization  across  layers  may  lead  to 
complex  algorithm. 

MIMO  systems  can  support  higher  data  rate  under 
the  same  transmit  power  budget  and  bit-error-rate  perfor¬ 
mance  requirement  as  single-input-single-output  (SISO) 


systems.  However,  direct  application  of  multi-antenna 
techniques  to  sensor  networks  is  impractical  duo  to  the 
limited  physical  size  of  a  sensor  node  that  is  typically 
can  only  support  a  single  antenna.  Fortunately,  if  we 
allow  individual  single-antenna  nodes  to  cooperate  on 
information  transmission  and/or  reception,  a  cooperative 
MIMO  system  can  be  constructed  such  that  energy 
efficient  MIMO  scheme  can  be  deployed  [2]. 

Using  cooperative  MIMO,  we  show  that  the  end-to- 
end  performance  can  be  dramatically  improved.  More¬ 
over,  the  novel  approach  of  distributed  Alamouti  [3] 
coding  provides  diversity  gain  with  no  local  information 
exchange,  as  is  typically  required  in  node  cooperation. 

However,  the  routing  and  MAC  layer  protocols  have 
different  effects  on  network  performance  compared  with 
cooperative  MIMO  technique.  Performance  analysis  and 
simulation  results  have  been  illustrate  this  situation 

The  remainder  of  this  paper  is  structured  as  following. 
In  section  II,  we  introduce  the  preliminaries.  In  section 
III,  we  make  the  performance  analysis  for  all  the  layers. 
In  section  IV,  we  make  the  performance  analysis  for 
cross-layer  model.  Simulation  results  and  discussions  are 
presented  in  section  V.  In  section  VI,  we  conclude  the 
paper. 

II.  Preliminaries 
A.  IEEE  802.11a  OFDM  PHY 

The  physical  layer  is  the  interface  between  the  wire¬ 
less  medium  and  the  MAC  [4],  The  principle  of  OFDM 
is  to  divide  a  high-speed  binary  signal  to  be  transmitted 
over  a  number  of  low  data-rate  subcarriers.  A  key 
feature  of  the  IEEE  802.11a  PHY  is  to  provide  8  PHY 
modes  with  different  modulation  schemes  and  coding 
rates,  making  the  idea  of  link  adaptation  feasible  and 
important.  BPSK,  QPSK,  16-QAM  and  64-QAM  are  the 
supported  modulation  schemes.  The  OFDM  provides  a 


data  transmission  rates  from  6  to  54MBPS.  The  higher 
code  rate  of  2/3  and  3/4  are  obtained  by  puncturing  the 
original  rate  1/2  code. 

B.  IEEE  802.11  MAC 

The  802.1 1  MAC  uses  Carrier-Sense  Multiple  Access 
with  Collision  Avoidance  (CSMA/CA)  to  achieve  au¬ 
tomatic  medium  sharing  between  compatible  stations. 
In  CSMA/CA,  a  station  senses  the  wireless  medium  to 
determine  if  it  is  idle  before  it  starts  transmission.  If 
the  medium  appears  to  be  idle,  the  transmission  may 
proceed,  else  the  station  will  wait  until  the  end  of 
the  in-progress  transmission.  A  station  will  ensure  that 
the  medium  has  been  idle  for  the  specified  inter-frame 
interval  before  attempting  to  transmit. 

Besides  carrier  sense  and  RTS/CTS  mechanism,  an 
acknowledgment  (ACK)  frame  will  be  sent  by  the  re¬ 
ceiver  upon  successful  reception  of  a  data  frame.  Only 
after  receiving  an  ACK  frame  correctly,  the  transmitter 
assumes  successful  delivery  of  the  corresponding  data 
frame.  The  sequence  for  a  data  transmission  is:  RTS- 
CTS-DATA-ACK. 

A  mobile  node  will  retransmit  the  data  packet  when 
finding  failing  transmission.  Retransmission  of  a  signal 
packet  can  achieve  a  certain  probability  of  delivery. 
There  is  a  relationship  between  the  probability  of  de¬ 
livery  p  and  retransmission  times  n  [5]: 

n  =  1.45  In  — - —  (1) 

1  -p 

The  IEEE  802.11  standard  requires  that  the  transmit¬ 
ter’s  MAC  discard  a  data  frame  after  certain  number 
of  unsuccessful  transmission  attempts.  According  to  the 
requirement  of  probability  of  delivery,  we  choose  the 
minimum  number  of  retransmission.  The  advantage  is 
we  can  save  energy  through  avoiding  unnecessary  re¬ 
transmission,  and  ensure  probability  of  delivery. 

C.  Network  Layer 

The  Ad  hoc  On  Demand  Distance  Vector  (AODV) 
routing  algorithm  is  a  routing  protocol  designed  for  ad 
hoc  mobile  networks  [6].  AODV  is  capable  of  both  uni¬ 
cast  and  multicast  routing.  It  is  an  on  demand  algorithm, 
meaning  that  it  builds  routes  between  nodes  only  as 
desired  by  source  nodes.  It  maintains  these  routes  as  long 
as  they  are  needed  by  the  sources.  Additionally,  AODV 
forms  trees  which  connect  multicast  group  members.  The 
trees  are  composed  of  the  group  members  and  the  nodes 
needed  to  connect  the  members.  AODV  uses  sequence 


numbers  to  ensure  the  freshness  of  routes.  It  is  loop- 
free,  self-starting,  and  scales  to  large  numbers  of  mobile 
nodes. 

AODV  builds  routes  using  a  route  request/route  reply 
query  cycle.  As  long  as  the  route  remains  active,  it  will 
continue  to  be  maintained.  A  route  is  considered  active 
as  long  as  there  are  data  packets  periodically  travelling 
from  the  source  to  the  destination  along  that  path.  Once 
the  source  stops  sending  data  packets,  the  links  will  time 
out  and  eventually  be  deleted  from  the  intermediate  node 
routing  tables.  If  a  link  break  occurs  while  the  route  is 
active,  the  node  upstream  of  the  break  propagates  a  route 
error  message  to  the  source  node  to  inform  it  of  the  now 
unreachable  destinations. 

Multicast  routes  are  set  up  in  a  similar  manner.  AODV 
maintains  routes  for  as  long  as  the  route  is  active.  This 
includes  maintaining  a  multicast  tree  for  the  life  of  the 
multicast  group.  Because  the  network  nodes  are  mobile, 
it  is  likely  that  many  link  breakages  along  a  route  will 
occur  during  the  lifetime  of  that  route. 

D.  Billiard  Mobility 

Pick  a  uniform  random  direction,  advance  in  that 
direction.  A  new  random  direction  is  selected  every  T 
seconds,  where  T  is  exponentially  distributed.  Bound¬ 
aries  are  reflecting,  i.e.  a  node  bounces  off  as  off  of  a 
mirror. 

E.  Node  Mobility  and  Channel  Fading 

Mobility  of  a  mobile  node  generates  a  doppler  shift, 
which  is  a  key  parameter  of  fading  channel.  The  doppler 
shift  is 

fd  = -fc  (2) 

c 

where  v  is  the  ground  speed  of  a  mobile  node,  c  is  the 
speed  of  light  (3  x  108m/s),  and  fc  is  the  carrier.  In  our 
simulation,  we  used  the  carrier  is  6GHz,  For  reference, 
if  a  node  moves  with  speed  10 m/s,  the  doppler  shift  is 
200 Hz. 

We  model  channel  fading  in  wireless  sensor  networks 
as  Rayleigh  fading.  Rayleigh  fading  occurs  when  there  is 
a  strong  specular  (direct  path  or  line  of  sight  component) 
signal  in  addition  to  the  scatter  (multipath)  components. 
For  example,  in  communication  between  two  infrared 
nodes,  there  exist  a  direct  path.  The  channel  gain, 

9(t)  =9i(t)  +j9Q{t)  (3) 

can  be  treated  as  a  wide-sense  stationary  complex  Gaus¬ 
sian  random  process,  and  gj(t )  and  gQ(t)  are  Gaussian 
random  processes  with  zero  means;  and  they  have  same 


variance  a1,  then  the  magnitude  of  the  received  complex 
envelop  has  a  Rayleigh  distribution, 


x  a: 

Pa(x)  =  -2  exp{-— 2}  x>0 


This  kind  of  channel  is  known  as  Rayleigh  fad¬ 
ing  channel.  A  Rayleigh  channel  is  characterized  by 
parameter,  the  Doppler  spread  (or  single-sided  fading 
bandwidth)  fd-  The  Rayleigh  fade  generator  is  based  on 
Jakes’  model  [7]  in  which  an  ensemble  of  sinusoidal 
waveforms  are  added  together  to  simulate  the  coherent 
sum  of  scattered  rays  with  Doppler  spread  fd  arriving 
from  different  directions  to  the  receiver. 

For  the  values  of  M  for  MPSK(Mary  phase  shift 
keying),  one  can  use  the  approximate  BER  expression 
obtained  by  Lu  et  al  [8].  For  the  AWGN(Additive  white 
Gaussian  niose),  which  is  accurate  for  a  wide  range 
of  SNRs,  again  making  the  substitution  -ylog2M  for 
Eh/N0  followed  by  averaging  over  the  PDF  of  7.  Using 
the  alternative  form  of  the  Gaussian  Q-function,  it  is 
straightforward  to  show  that  the  result  of  the  evaluation 
is  given  by: 


ea  _ 2 _ s-'max(Mi  d)  j_ 

1  b\&)  -  max(log2  A/,2)  2->i- 1  it  * 
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where  M7(s)  is  the  MGF(Moment  generating  func- 
tion)of  the  instantneous  fading  power  7.  For  a  Rayleigh 
fading  channel,  we  obtain  the  following  analogous  to: 


Pb(E) 


'  1  Y^max(  ^  ,1) 

max( Iog2  M} 2)  2^i= 1 
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For  BPSK,  we  get  the  result: 


Pb(E)  =  -(1 


III.  Performance  Analysis 


A.  MIMO 


It  is  wildly  understood  that  in  a  system  with  multiple 
transmit  and  receive  antennas,  the  spectral  efficiency  is 
much  higher  than  that  of  the  conventional  single-antenna 
channels,  i.e.,  a  MIMO  system  can  provide  two  types  of 
gains:  diversity  gain  and  multiplexing  gain. 


1)  MIMO  Diversity  Gain:  The  multiple  antennas  at 
the  transmitter  and  receiver  can  be  used  to  obtain  di¬ 
versity  gain:  This  scheme  is  also  referred  to  as  MIMO 
beamforming  [9]  [10].  Beamforming  provides  diversity 
gain  via  coherent  combining  of  the  multiple  signal  paths. 
Channel  knowledge  at  the  receiver  is  assumed,  since 
this  is  required  for  coherent  combining.  A  beamforming 
strategy  corresponds  to  the  precoding  and  shaping  ma¬ 
trices  being  just  column  vectors:  V=v  and  U  =  u.The 
transmit  symbol  x  is  sent  over  the  ith  antenna  with 
weight  Vi.  On  the  receive  side,  the  signal  received  on 
the  ith  antenna  is  weighted  by  u*.  The  resulting  received 
signal  is  given  by: 


y  =  uH  Hvx  +  uHn 


The  performance  gain  then  depends  on  whether  or  not 
the  channel  is  known  at  the  transmitter.  When  the  chan¬ 
nel  matrix  H  is  known,  the  received  SNR  is  optimized  by 
choosing  u  and  v  as  the  principal  left  and  right  vectors 
of  the  channel  matrix  H.  That  is,  the  reveived  SNR  can 
be  shown  to  equal  7  =  cr^aa.p,  where  amax  is  the  largest 
singular  value  of  H. 

When  the  channel  is  not  known  to  the  transmitter,  for 
Mt- 2  the  Alamouti  scheme  described  in  [3]  can  be  used 
to  extract  an  array  gain  of  Mr  and  the  maximum  diversity 
gain  of  2 Mr. 

Two-Branch  Transmit  Diversity  with  one  receiver  the 
scheme  uses  two  transmit  antennas  and  one  receive 
antenna  and  may  be  defined  by  the  following  three 
functioin:  the  ending  and  transmission  sequence  of  infor¬ 
mation  symbols  at  the  transmitter;  the  combining  scheme 
at  the  receiver;  the  decision  rule  for  maximum  likelihood 
detection. 

The  encoding  and  transmission  sequence:  at  a  given 
symbole  period,  two  signals  are  simultaneously  tans- 
mitted  from  the  two  antennas.  The  signal  transmitted 
from  antenna  zero  is  denoted  by  ci  and  from  antenna 
one  by  c2.  During  the  next  symbol  period  signal  (- 
cJj)  is  transmiited  from  antenna  zero,  and  signal  c\  is 
transmitted  from  antenna  one  where  *  is  the  complex 
conjugate  operation. 

r\  =  h\c\  +  h2C2  +  ni  (9) 

r2  =  —h\C^  +  h2c\  +  n2  (10) 

where  r\  and  r2  are  the  received  signals  at  time  t 
and  t+T  and  n\  and  n2  are  complex  random  variables 
representing  receiver  noise  and  interference. 


(O.M'M,) 


We  will  build  the  following  two  combined  signals  that 
are  sent  to  the  maximum  likelihood  detector: 

c\  =  (a?  +  a|)ci  +  h\n  i  +  hin\  (1 1) 

C2  =  (<*1  +  ct2)c2  +  h\nl  +  h\n\  (12) 


The  combined  signals  are  then  sent  to  the  maximum 
likelihood  detector,  uses  the  decision  rule  to  decide  the 
signals  c\  or  c 2. 

It  is  further  shown  that  the  scheme  may  easily  be  gen¬ 
eralize  to  M  transmit  antennas  and  one  receive  antenna 
to  provide  diversity  gain  M. 

2)  MIMO  Multiplexing  Gain:  One  mechanism  for 
utilizing  to  improve  wireless  system  performance  is  to 
obtain  gain  by  decomposing  the  MIMO  channel  into 
parallel  channels  and  multiplexing  different  data  streams 
onto  these  channels.  By  multiplexing  independent  data 
onto  these  independent  channels,  we  get  an  increase  in 
data  rate  in  comparison  to  a  system  with  just  one  antenna 
at  the  transmitter  and  receiver.  This  increased  data  rate 
is  called  multiplexing  gain. 

3)  MIMO  Diversity-Multiplexing  Gain  Tradeoffs:  The 
MIMO  diversity-multiplexing  gain  tradeoffs  is  essen¬ 
tially  the  tradeoff  between  the  error  probability  and  the 
data  rate  of  a  system. 

A.  Optimal  Tradeoff  Curve 

A  schemec(SNR)is  said  to  achieve  spatial  multiplex¬ 
ing  gain  r  and  diversity  gain  d  if  the  data  rate  [11]: 
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For  each  r,  the  optimal  diversity  gain  dopt(r )  is  maxi¬ 
mum  diversity  gain  that  can  be  achieved  by  any  scheme. 


dopt{r)  =  (Mt  -  r)(Mr  -  r )  (15) 


0  <  r  <  Mr )  (16) 

Equation(15)(16)  are  plotted  in  Figure  1. 

This  figure  implies  if  we  use  all  transmit  and  receive 
antennas  for  diversity  then  we  get  full  diversity  gain 
MtMr  and  we  can  use  some  of  these  antennas  to  increase 
data  rate  at  the  expense  of  diversity  gain. 

B. Outage  Formulation 

Channel  outage  is  usually  discussed  for  nonergodic 
fading  channels,  i.e.,  the  channel  matrix  H  is  choosen 


Multiplexing  Gain  i-R’Iog.;SNR) 


Fig.  1 .  Diversity-multiplexing  trade-offs 


randomly  but  is  held  fixed  for  all  time.  This  nonergodic 
channel  can  be  written  as: 

I  SNR  TT 

Vt  =  \ - Hxt  +  wt  (17) 

V  m 

where  xt  £  Cm,  yt  £  Cn  are  the  transmitted  and 
received  signal  at  time  t,  and  wt  £  Cn  is  the  additive 
Gaussian  noise.  An  outage  is  defined  as  the  event  that 
the  mutual  information  of  this  channel  does  not  support 
a  target  data  rate: 

H  :  I{xf,yt\H  =  H)  <  R  (18) 

The  mutual  information  is  function  of  the  input  dis¬ 
tribution  P(xt)  and  the  channel  realization.  Without  loss 
of  optimality,  the  input  distribution  can  be  taken  to  be 
Gaussian  with  a  covariance  matrix  Q,  in  which  case 

C  AT 

I(xt;yt\H  =  H)  =  logdet{I  +  —r-—HQH+)  (19) 

Mf 

Optimzing  over  all  input  distributions,  and  on  the  scale 
of  interest,  the  bounds  are  tight,  we  have 

Pout(R)  A  P[logdet(I  +  SNRHH+)  <  R]  (20) 

For  the  multiple-antenna  channel,  let  the  data  rate  be 
R=  rlog  SNR,  with  r  <  mirn(Mt,  Mr).  The  outage 
probability  satisfies 

Pout(r  log  SNR)  A  SNR~d°'“ <r>  (21) 


where 


roin(Me,Mr) 

dout(r)  =  inf  Y'  2i  -  1  +  \Mt  -  Mr|a,  (22) 

aeA  ^ J 
i=l 


and 

oti  =  —  log  pi/  log  SNR  (23) 


B.  IEEE  802.11  MAC  Protocol 

pc  is  the  probability  of  a  collision  seen  by  a  packet 
being  transmitted  on  the  medium  [12]  [13]  [14],  Assum¬ 
ing  that  there  are  n  stations  in  the  wireless  LAN  we  are 
considering,  we  observe  that  pc  is  also  the  probability 
that  there  is  at  least  one  packet  transmission  in  the 
medium  among  other  (n-1)  stations  in  the  interference 
range  of  the  station  under  consideration.  This  yields: 

Pc  =  1  -  [1  -  (1  -  Po)t]"~1  (24) 


The  average  queue  length,  blocking  probability,  and 
average  waiting  time  including  MAC  sevice  time  are 
given  by, 

K 

L  =  Y^i*Pi  (28) 

i=0 


PB 


1  - 


1 

7T0  +  p 
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where  po  is  the  probability  that  there  is  no  packet 
ready  to  transmit  at  the  MAC  layer  in  wireless  station  un¬ 
der  consideration,  and  r  is  the  packet  transmission  proba¬ 
bility  that  the  station  transmits  in  a  randomly  chosen  slot 
time  given  that  the  station  has  packets  to  transmit  at  the 
MAC  layer.  In  non-saturated  scenario,  pc  mainly  depends 
on  the  total  number  of  packets  attempting  to  transmit  by 
all  neighbouring  stations.  However,  in  saturated  scenario, 
i.e.  the  stations  always  have  packets  to  transmit,  the  total 
number  of  packets  attempting  to  transmit  equals  to  the 
total  number  of  neighboring  stations,  hence  pc  is  mainly 
dependent  on  the  total  number  of  neighboring  stations. 

A  queue  model  can  be  characterized  by  the  service 
time  distribution  and  the  arrival  process  in  addition  to  the 
service  dicipline.  In  this  paper,  we  assume  that  the  packet 
arrivals  at  each  mobile  station  follow  the  Poisson  process 
or  a  deterministic  distribution  with  average  arrival  rate 
A.  The  packet  transmission  process  at  each  station  can 
be  modeled  as  a  general  single  server.  The  buffer  size 
at  each  station  is  K.  Thus  the  queueing  model  for  each 
station  can  be  modeled  as  an  M/G/l/K  when  Poisson 
arrivals  of  packets  are  assumed. 

Let  pn  represent  the  steady-state  probability  of  n 
packets  in  the  queueing  system,  and  let  7r„  represent  the 
probability  of  n  packets  in  the  queueing  system  upon 
a  departure  at  the  steady  state,  A'  is  the  average  arrival 
rate,  Ts  is  the  duration  of  time  taken  for  a  state  transition 
from  the  start  state  (  beginning  to  be  served)  to  the  end 
state  (being  transmitted  successfully  or  discarded  after 
maximum  a  times  retranmission  failures),  p  is  the  traffic 
internsity  p  =  A 'Ts.  For  the  finite  system  size  K  with 
Poisson  input  [15],  we  have 


Po 
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7T0  +  p 
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If  we  know  the  blocking  probability  pB,  then  the 
throughput  S  can  be  computed  easily  by: 

S  =  (l-PB)(1-Pac+1)  (31) 

where  a  is  the  maximum  retransmission  times,  p"+1  is 
the  packet  discard  probability  due  to  tranmission  failure. 

Suppose  a  is  constant,  as  n  increase,  pc  increase,  the 
he  packet  discarding  probability  at  MAC  layer  increase. 

Throughput  linearly  increases  with  the  offered  load  at 
the  non-saturated  status  and  maintains  a  constant  value 
with  different  total  number  of  transmitting  stations  at  the 
saturated  status.  As  the  n  increases,  the  constant  value 
decrease. 


C.  AODV  Routing  Protocol 

Using  the  Opnet  and  NS-2,  we  could  conduct  an 
extensive  set  of  performance  experiments  for  the  AODV 
routing  protocol  [16]  [17]  [18]. 

1)  With  the  increase  of  number  of  hops,  throughput 
degrades  due  to  the  higher  delay. 

2)  With  the  increase  of  loads  (i.e.  application  traffic), 
throughput  can  again  be  degraded  due  to  the  loss 
at  the  link  layer.  Link  layer  losses  could  be  due  to 
problems  of  hidden/exposed  node  or  collisions  in 
the  wireless  media. 

3)  the  connectivity  between  nodes  decreases, 
throughput  also  decrease. 

4)  the  mobility  increase,  throughput  decrease. 

IV.  Cross-layer  Analysis 

When  we  combine  the  Physical  layer,  MAC  layer 
and  network  layer,  we  need  to  analyze  two  performance 
parameters:  throughput  and  packet  loss  ratio(PLR). 


1)  Throughput:  From  the  performance  analysis  of  the 
physical  layer,  MIMO  scheme  could  achieve  multiplex¬ 
ing  gain  to  increase  the  throughput  as  the  number  of  Tx 
increases. 

For  the  CSMA/CA  MAC  protocol,  in  non-saturated 
scenario,  throughput  linearly  increases  with  the  offered 
load,  i.e.,  the  throughtput  will  increase,  as  the  number  of 
Tx  increase.  As  the  Tx  increases,  it  will  be  the  saturated 
scenario.  The  throughput  will  decrease  as  the  number  of 
Tx  increases. 

For  AODV  routing  protocol,  if  we  increase  the  number 
of  Tx,  we  need  more  routing  paths,  the  connectivity 
between  nodes  dreases,  throughtput  decrease. 

2)  Packet  Loss  Ratio:  MIMO  scheme  could  achieve 
diversity  gain  to  increase  the  performance  as  the  number 
of  Tx  increases.  So  the  BER  will  decrease.  If  each  bit 
inside  the  L  length  packet  has  the  same  BER  and  bit¬ 
errors  are  uncorrelated,  the  PER  can  be  related  to  the 
BER  through  [19]: 

PER  =  1  -  (1  -  BER)l  (32) 

We  could  conclude  that  for  MIMO  scheme,  the  PER  will 
decrease  as  the  number  of  Tx  increase. 

For  CSMA/CA  MAC  protocol,  as  Tx  increases,  the  to¬ 
tal  number  of  packets  attempte  to  transmit  will  increase, 
the  collision  probability  will  increase,  suppose  block 
probability  is  given,  the  maximum  retransmission  times 
is  constant,  the  packet  discard  probability  will  increase 
shown  in  equation  (31). 

In  AODV  routing  protocol,  the  node  density  keeps 
unchanged  as  the  increase  of  the  number  of  the  trans¬ 
mitters.  It  is  hard  to  establish  enough  routing  paths.  The 
buffer  size  in  each  sensor  node  is  constant,  the  arriving 
packets  rate  is  constant,  more  packets  will  be  discarded 
when  the  buffer  is  full. 

V.  Simulations 


6 


Fig.  2.  System  Architecture  for  cross-layer  design 

Figure  2  illustrates  the  system  architecture  for  cross¬ 
layer  design.  Only  the  source  sensor  nodes  send  out  the 


packets.  The  source  sensor  nodes  need  to  set  up  paths 
to  the  destination  sensor  node.  There  are  three  phases 
during  the  packets  transmissions. 

1)  MIMO  scheme  for  physical  layer. 

2)  set  up  the  link  in  MAC  layer. 

3)  routing  path  discovery  in  network  layer. 

We  use  Matlab  to  obtain  the  BER-SNR  curves  for 
MIMO  schemes.  We  run  1000  Monte-Carlo  simulations 
to  get  the  physical  layer  curves. 

We  implemented  the  cross-layer  model  using  the 
OPNET  modeler.  The  simulation  region  is  1000x1000 
meters.  The  wireless  communcation  range  is  300  meters. 
There  were  49  mobile  nodes  in  the  simulation  model, 
and  the  nodes  were  roaming  independently  with  variable 
ground  speed  between  0  to  10  meters  per  second.  The 
mobility  model  was  called  billiard  mobility  model,  the 
maximum  retransmission  times  is  7  and  arriving  packet 
distribution  is  Poisson.  The  modulation  scheme  is  BPSK. 

3)  Throughput:  Figure  3  is  the  throughput  perfor¬ 
mance  for  the  cross-layer  model.  It  shows  when  the 
throught  is  maximum  when  the  number  the  transmitter 
is  11.  As  the  number  of  Tx  increases,  throughput  will 
increase  for  the  MIMO  scheme.  For  the  MAC  layer, 
when  Tx  is  small,  it  is  the  non-saturated  scenario.  As 
the  Tx  increase,  throughput  will  increase.  When  Tx  is 
large  enough,  it  is  the  saturated  scenario.  As  the  Tx 
increases,  throughput  will  decrease.  For  AODV  protocol, 
the  collision  probability  will  increase  as  the  the  Tx 
increases.  The  simulation  result  is  consented  with  the 
analysis  result. 


Fig.  3.  Throughput  Vs  Number  of  Transmitters 


4)  Packet  Loss  Ratio:  Figure  4  is  the  packet  loss  rate 
performance  for  the  cross-layer  model.  The  change  of 
curve  in  fig.4  is  instability,  but  the  best  performance  for 
PLR  was  achieved  when  the  number  of  transmitter  is  10. 
As  the  Tx  increases,  BER  decreases,  so  PER  decreases, 


more  packets  will  be  discarded,  PLR  decreases.  For 
MAC  layer  protocol,  when  Tx  increases,  the  probability 
of  collision  will  increase.  The  packet  loss  ratio  will 
keep  stable  for  retransmission  could  solve  the  collision 
problem.  If  the  number  of  Tx  is  big  enough,  retransmis¬ 
sion  number  will  exceed  the  maximum  retransmission 
number  a,  large  number  of  prackets  will  be  discarded, 
the  PLR  will  increase  sharply.  For  AODV  protocol,  as 
the  increase  of  the  Tx,  it  is  hard  ro  set  up  enough 
routing  pathes.  Large  number  of  packets  will  jam  in 
the  finite-size  buffer,  more  packets  will  be  discarded. 
The  simulation  result  matched  well  with  the  performance 
analysis. 


Fig.  4.  PLR  Vs  Number  of  Transmitters 


VI.  Conclusion 

MIMO  scheme  is  a  effective  method  to  improve  the 
performance  of  the  wireless  sensor  networks.  We  apply 
the  cross-layer  design  to  combine  physical  layer,  data- 
link  layer  and  network  layer  together.  For  the  data-link 
and  network  layer  protocolas  have  different  effect  on  the 
WSN  performance.  Throughput  and  packet  performance 
will  be  have  different  performance  compared  with  only 
consider  the  MIMO  scheme  in  physical  layer  as  the 
increase  of  the  number  of  transmitters. 

Acknowledgment 

This  work  was  supported  by  the  U.S.  Office  of  Naval 
Research  (ONR)  Young  Investigator  Award  under  Grant 
N000 1 4-03- 1  -0466. 


[2]  Cui,  S.;  Goldsmith  A.J.;  “  Cross-Layer  Optimization  of  Sensor 
Networks  based  on  Cooperative  MIMO  Techniques  with  Rate 
Adaptation,”  Proceedings:  IEEE  workshop  on  Signal  Processing 
Advances  in  Wireless  Communications  (SPAWC),  New  York,  NY, 
pp.  960-964,  Jun  2005. 

[3]  Almouti,  S.;  “A  simple  Transmit  Diversity  Technique  for 
Wireless  Communications”,  IEEE  Journal  on  Select  Area  In 
Communication,  Vol.  16,  No.  8,  October  1998  . 

[4]  D.  Qiao,  S.  Choi,  and  K.  G.  Shin,  “  Goodput  Analysis  and  Link 
Adaption  for  IEEE  802.1  la  Wireless  LANs”,  IEEE  Transactions 
On  Mobile  Computing ,  Oct.  2002. 

[5]  Bao,  L.H.;  Garcia-Luna-Aceves,  J.J.;  “  Hybrid  Channel  Access 
Scheduling  in  Ad  Hoc  Networks”,  IEEE  Computer  Society, 
Washington,  DC,  USA. 

[6]  Chakeres,  I.  etc;  “The  Ad  hoc  On  Demand  Distance  Vector 
(AODV)  routing  algorithm”,  University  of  California  at  Santa 
Barbara  ,  Santa  Barbara  ,  CA,  USA. 

[7]  G.L.  Stuber;  “Principles  of  Mobile  Communcation”  Kluwer 
Academic  Press,  2001. 

[8]  k.  Simon,  et  al,  "Digital  Communication  over  Fading  Channels," 
Wiley  Interscience,  2005. 

[9]  Tarokh,  V.;  Jafarkhani  H.;  Calderbank  A.  R.;  “Space-Time  Block 
Coding  for  Wireless  Communications:  Performance  Result”, 
IEEE  Journal  on  Selected  Areas  in  Communications,  Vol.  17, 
No.3,  March  1999. 

[10]  Goldsmith,  A.;  “Wireless  Communication”,  Cambridge  Uni¬ 
versity,  2005. 

[11]  Zheng,  L.  and  Tse,  D.N.C.;  “  Diversity  and  Multiplexing: 
A  Fundamental  Tadeoff  in  Multiple-Antenna  Channels,”  IEEE 
Transaction  On  Information  Theory ,  Vol.  49  ,  No.  5  ,  May  2003. 

[12]  Bianchi,  G.;  “Performance  Analysis  of  IEEE  802.1 1  Distributed 
Coordination  Function”,  IEEE  Journal  on  Selected  in  Commu¬ 
nications.  Vol.  18,  No.  3,  March  2000. 

[13]  Zeng,  G.;  Chlamtac  I.;  “A  Finite  Queueing  Model  For  IEEE 
802.11  MAC  Protocol”,  IEEE  59th  Vehicular  Technology  Con¬ 
ference,  VTC  2004-Spring,  Vol  4,  17-19  May  pp.  2037  -  2041. 

[14]  Zhai,  H.;  Fang  Y.;  “Performance  of  Wireless  LANs  Based  on 
IEEE  802.11  MAC  Protocols”,  1 4th  IEEE  International  Sym¬ 
posium  on  Personal,  Indoor  and  Mobile  Radio  Communications 
(PIMRC’03),  Beijing,  China,  September,  2003. 

[15]  Gross, D  and  Harris,  C.  M.;  “Fundamental  of  Queueing  Theory”, 
3rd  ed.,  John  Wiley  &  Sons,  Inc,  1998. 

[16]  Ari,  I.;  Jethani,  N.;  Rangnekar,  A.;  Natarajan,  S.;  “  Perfor¬ 
mance  Analysis  and  Comparision  of  Ad-Hoc  Routing  Protocols,” 
UMBC ,  CMSC  691T, Mobile  Computing,  Project  Report. 

[17]  Gerasimov,  I.;  Simon,  R.;  “  Performance  Analysis  for  Ad  Hoc 
QoS  Routing  Protocols,”  International  Mobility  and  Wireless 
Access  Workshop  (MobiWac'02),  2002. 

[18]  Kuladinithi,  K.;  Gorg,  R.;  “  Performance  Analysis  of  Ad 

hoc  On-demand  Distance  Vector  routing  (AODV)  using  OPNET 
Simulator,”  University  of  Bremen,  Communication  Networks, 
Project  Report. 

[19]  Liu,  Q.,  Zhou,  S.  and  Giannakis,  G.;  “  Cross-Layer  Combining 
of  Adaptive  Modulation  and  Coding  with  Truncated  ARQ  over 
Wireless  Links  ,”  IEEE  Transactions  on  Wireless  Communica¬ 
tions,  Volume:  3  ,  Issue:  5  ,  Sept.  2004  pp.  1746  -  1755. 


References 

[1]  Goldsmith  A.J.;  Wicker,  S.B.;  “  Design  Challenges  for  Energy- 
Constrained  Ad  Hoc  Wireless  Networks,”  IEEE  Wireless  Comm.,, 
vol.  9,  No.  4  ,2002  ,  pp.  8-27. 


1 

Hop-Distance  Estimation  in  Wireless  Sensor 
Networks  with  Applications  to  Resources 

Allocation 

Liang  Zhao  and  Qilian  Liang 
Department  of  Electrical  Engineering 
University  of  Texas  at  Arlington 
Arlington,  TX  76010,  USA 
Email:  zhao@wcn.uta.edu,  liang@uta.edu 

Abstract 

In  this  paper,  we  address  a  fundamental  problem  in  Wireless  Sensor  Networks,  how  many 
hops  does  it  take  for  a  packet  to  be  relayed  for  a  given  distance?  For  a  deterministic  topology, 
this  question  reduces  to  a  simple  geometry  problem.  However,  a  statistical  study  is  needed  for 
randomly  deployed  WSNs.  We  propose  a  Maximum  Likelihood  decision  based  on  the  conditional 
pdf  of  f(r\Hi).  Due  to  the  computational  complexity  of  we  also  propose  an  attenuated 

Gaussian  approximation  for  the  conditional  pdf.  We  show  that  the  approximation  visibly  simplifies 
the  decision  process  and  the  error  analysis.  The  latency  and  energy  consumption  estimation  are  also 
included  as  application  examples.  Simulations  show  that  our  approximation  model  can  predict  the 
latency  and  energy  consumption  with  less  than  half  RMSE,  compared  to  the  linear  models. 

I.  Introduction 

The  recent  advances  in  MEMS,  embedded  systems  and  wireless  communications  enable 
the  realization  and  deployment  of  wireless  sensor  networks  (WSN),  which  consist  of  a 
large  number  of  densely  deployed  and  self-organized  sensor  nodes  [1]— [3],  The  potential 
applications  of  WSN,  such  as  environment  monitor,  often  emphasize  the  importance  of 
location  information.  Fortunately,  with  the  advance  of  localization  technologies,  such  location 
information  can  be  accurately  estimated  [4]— [7].  Accordingly  geographic  routing  [8]— [10] 
was  proposed  to  route  packets  not  to  a  specific  node,  but  to  a  given  location.  An  interesting 


question  arises  as  “how  many  hops  does  it  take  to  reach  a  given  location?”  The  prediction  of 
the  number  of  hops  is  important  not  only  in  itself  but  also  in  helping  estimating  the  latency 
and  energy  consumption,  which  are  both  important  to  the  viability  of  WSN. 

The  question  could  become  very  simple  if  the  sensor  nodes  are  manually  placed.  However, 
if  sensor  nodes  are  deployed  in  a  random  fashion,  the  answer  is  beyond  the  reach  of  simple 
geometry.  The  stochastic  nature  of  the  random  deployment  calls  for  a  statistical  study. 

The  relation  between  the  Euclidean  distance  and  network  distance  (in  terms  of  the  number 
of  hops)  catches  a  lot  of  research  interest  recently.  In  [11],  Huang,  Lu  and  Roman  defined 
the  T-compactness  of  a  geometric  graph  G(V,  E )  to  be  the  minimum  ratio  of  the  Euclidean 
distance  to  the  network  distance, 


7  =  mm 

'  T  / 


d{i,j) 

i'je'v  h(i,j) 


(1) 


where  d(i,j)  and  h(i,j)  are  the  Euclidean  distance  and  network  distance  between  node  i 
and  j,  respectively.  The  constant  value  7  is  a  good  lower  bound,  but  might  not  be  enough  to 
describe  the  non-linear  relation  between  Euclidean  distance  and  network  distance.  Fortunately, 
a  lot  of  probabilistic  study  has  been  applied  to  this  question.  In  [12],  Hou  and  Li  studied 
the  2-D  Poisson  distribution  to  find  an  optimal  transmission  range.  They  found  that  the 
hop-distance  distribution  is  determined  not  only  by  node  density  and  transmission  range  but 
also  by  the  routing  strategy.  They  showed  results  for  three  routing  strategies,  Most  Forward 
with  Fixed  Radius,  Nearest  with  Forward  Progress,  and  Most  Forward  with  Variable  Radius. 
Cheng  and  Robertazzi  in  [13]  studied  the  one-dimension  Poisson  point  and  found  the  pdf 
of  r  given  the  number  of  hops.  They  also  pointed  out  the  2-D  Poisson  point  distribution  is 
analogous  to  the  1-D  case,  replacing  the  length  of  the  segment  by  the  area  of  the  range. 
Vural  and  Ekici  re-examined  the  study  under  the  sensor  networks  circumstances  in  [14], 
and  gave  the  mean  and  variance  of  multi-hop  distance  for  1-D  Poisson  point  distribution. 
They  also  proposed  to  approximate  the  multi-hop  distance  using  Gaussian.  Zorzi  and  Rao 
derive  the  mean  number  of  hops  of  the  minimal  hop-count  route  through  simulations  and 
analytic  bounds  in  [10].  Chandler  [15]  derives  an  expression  for  f-hop  outage  probability 
for  2-D  Poisson  node  distribution.  However,  Mukherjee  and  Avidor  [16]  argue  that  one  of 
Chandler’s  assumptions  is  flawed  and  thus  his  expression  is  in  fact  a  lower  bound  on  the 
desired  probability.  They  also  rigorously  derive  the  pdf  of  the  minimal  number  of  hops  for  a 
given  distance  in  a  fading  environment.  Although  the  exact  analytic  results  are  available  in 
the  literature,  their  monstrous  computational  complexity  limits  their  applications.  Therefore, 
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we  try  to  approximate  the  hop-distance  relation  and  simplify  the  decision  process  and  error 
analysis  in  this  paper.  Considering  the  application  of  resource  allocation,  only  large-scale 
path  loss  is  considered  and  thus  the  fading  is  ignored. 

The  rest  of  this  paper  is  organized  as  follows.  We  provide  some  preliminaries  on  skewness 
and  kurtosis  in  Section  II.  The  number  of  hops  predication  problem  is  addressed  and  solved 
in  Section  III.  Since  this  problem  has  no  closed-form  solution,  we  propose  an  attenuated 
Gaussian  approximation  and  show  how  to  simplify  the  error  analysis  in  Section  III-A. 
Application  examples  are  shown  in  Section  IV.  Section  V  concludes  this  paper. 


II.  Preliminaries  :Skewness  and  Kurtosis 

In  this  section,  we  provide  some  preliminaries  on  statistical  methods  [17].  Skewness  is  a 
measure  of  symmetry,  or  more  precisely,  the  lack  of  symmetry.  A  distribution,  or  sample  set, 
is  symmetric  if  it  looks  the  same  to  the  left  and  right  of  the  center  point. 

Definition  1:  [17]  For  a  given  sample  set  X, 

m3  =  E  {X-Xf/n,  (2) 

m2  =  E  (X-X)2/n,  (3) 


where  X  is  the  sample  mean  of  X,  and  n  is  the  size  of  X.  Then  a  sample  estimate  of 
skewness  coefficient  is  given  by 


9i  =  -j  •  (4) 

ml 

Skewness  is  zero  for  a  symmetric  distribution.  Positive  skewness  indicates  right  skewness 
and  negative  indicates  left. 

Kurtosis  is  a  measure  of  whether  the  data  are  peaked  or  flat  relative  to  a  normal  distribution. 
Definition  2:  [17]  A  sample  estimate  of  kurtosis  for  a  sample  set  X  is  given  by 


g2  =  m4/ml  -  3,  (5) 

where  m4  =  E(A  —  X)A /n  is  the  fourth-order  moment  of  X  about  its  mean. 

Skewness  and  kurtosis  is  useful  in  determining  whether  a  sample  set  is  normal.  Note  that 
the  skewness  and  kurtosis  of  a  normal  distribution  are  both  zero;  significant  skewness  and 
kurtosis  clearly  indicate  that  data  are  not  normal. 
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III.  Estimation  of  Network  Distance  Based  on  Euclidean  Distance 

Suppose  the  sensor  nodes  are  placed  on  a  plane  at  random  at  an  average  density  of  A 
nodes  per  square  meters.  Let  N(A )  be  the  number  of  nodes  in  area  A,  it  can  be  shown  that 
N(A)  is  a  two-dimensional  Poisson  point  process  with  density  A.  The  problem  of  interest  is 
to  find  the  number  of  hops,  denoted  H{  needed  to  reach  a  specific  destination  r  from  a  given 
source  node.  We  can  make  a  Maximum  Likelihood  (ML)  decision, 

H  =  arg  max  f(r\Hi),i  =  1,2,3,  •••  ,  (6) 

where  Hi  can  also  be  described  as  “the  minimum  number  of  hops  is  i  from  the  source  to 
the  specific  node  with  Euclidean  distance  r”.  In  the  following  discussion,  we  are  trying  to 
approximate  /(rjfV,)  for  2-D  Poisson  distribution.  Note  that  r  <  R  — >  H\,  we  are  more 
interested  in  multiple-hop  distance  relation,  especially  for  i  is  relatively  large. 


A.  Attenuated  Gaussian  Approximation 


TABLE  I 

Statistics  of  /(r|i?i) 


Number  of  Hops 

Mean 

Std 

Skewness 

Kurtosis 

1 

19.991 

7.0651 

-0.57471 

-0.58389 

2 

45.132 

7.8365 

-0.16958 

-1.0763 

3 

72.01 

8.2129 

-0.10761 

-1.0332 

4 

99.45 

8.391 

-0.07938 

-0.97857 

5 

127.14 

8.5323 

-0.06445 

-0.93104 

6 

154.96 

8.6147 

-0.05341 

-0.9004 

7 

182.68 

8.573 

-0.07738 

-0.91687 

Since  f{r\Hi)  is  awkward  to  evaluate  even  using  numerical  methods,  we  use  histograms 
collected  from  Monte  Carlo  simulations  as  substitute  to  the  joint  pdf.  All  the  simulation 
data  are  collected  from  such  a  scenario  that  N  sensor  nodes  were  uniformly  distributed  in 
a  circular  region  of  radius  of  Rsound  meters.  For  convenience,  polar  coordinates  were  used. 
The  source  node  was  placed  at  (0, 0).  The  transmission  range  was  set  as  R  meters.  For  each 
setting  of  ( N ,  R Bound,  R),  we  ran  300  simulations,  in  each  of  which  all  nodes  are  re-deployed 
at  random.  We  ran  simulations  for  extensive  settings  of  node  density  A  and  transmission  range 
R.  Due  to  space  constraints,  only  the  histograms  for  (TV  =  1000,  RBound  —  200,  R  =  30) 
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Fig.  1.  Histograms  of  hop-distance  joint  distribution.  ( N  —  1000,  R.Bound  =  200,  R  =  30) 


are  plotted  in  Fig.  1,  which  approximately  shows  that  f{r\Hi)  approach  the  normal  when 
Hi  increases.  Table  I  lists  the  first-,  second-,  third-  and  fourth-order  statistics  of  f(H,r). 
The  skewness  and  kurtosis  clearly  satisfy  the  Gaussianity  condition  within  tolerance  of  error. 
Furthermore,  The  postulated  distribution  and  histogram  are  drawn  together  in  Fig.  2  (d)(e)(f), 
which  clearly  shows  a  close  match  for  each  case. 

Thus,  the  objective  function  can  be  approximated  by 


anN{ 

an 

- — e 
2na 


^11!  &Tl) 
(r-mn)2 


(7) 


where  a  is  the  equivalent  attenuation  base,  mn  and  an  are  the  mean  and  standard  de- 
viation(std),  respectively.  The  specific  values  of  these  parameters  can  be  estimated  from 
simulations.  Our  extensive  simulations  show  that,  even  for  only  relatively  large  Hit  f{r\Hi ) 
has  following  properties, 

1)  crn  «  <7„_i,  which  means  the  neighboring  joint  pdfs  have  similar  spread. 

2)  mn  -  mn_!  ft  mn+i  —  mn,  which  means  the  joint  pdfs  are  evenly  spaced. 

3)  3  <  <  5,  which  means  the  overlap  between  the  neighboring  joint  pdfs  is 

small  but  not  negligible.  (As  a  rule  of  thumbs,  Q( 3)  is  considered  relatively  small  and 
Q( 5)  is  regarded  negligible.) 
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Fig.  2.  The  histogram  vs.  postulated  distribution  for  end-to-end  distances  for  given  number  of  hops,  (a)  Three-hop.  (b) 
Four-hop.  (c)  Five-hop.  (d)  Six-hop. 


4)  mn~mn~2  5,  which  means  the  overlap  between  the  non-neighboring  joint  pdf’s  is 
negligible. 

5)  a  <  1.  For  large  density  A,  a  — »  1.  Along  with  Property  1,  this  tell  us  that  the 
neighboring  joint  pdfs  have  nearly  identical  shape. 

As  shown  in  the  following  discussion,  these  properties  largely  simplify  the  decision  rule 
and  the  error  analysis.  Another  interesting  observation,  besides  these  properties,  is  that  the 
following  equations  do  not  stand  true. 

mn  —  nrrii 
mn  =  nR 
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(8) 

(9) 


=  (n-l)R  +  R/2 


(10) 


mn 


Although  these  equations  sound  plausible,  they  all  give  visible  errors.  The  aforementioned 
estimator  [r/R]  +  1  for  Hl,  though  widely  used,  is  not  good  in  the  new  light  shed  by  this 
study. 


B.  Decision  Boundaries 


Fig.  3.  Gaussian  Approximation. 


Following  (6),  and  observe  the  f(r\Hi)  in  Fig.  3,  the  decision  is  needed  only  between 
neighboring  Hi}  that  is, 

f(r\Hn)  |  f(r\Hn  +  l).  (11) 

n+l 

This  is  because,  for  a  specific  value  of  r,  there  are  only  two  values  of  Hi  with  dominating 
f(r\Hi),  compared  to  which  f(r\Ht)  for  other  values  of  Hi  is  negligible.  Substitute  (7)  into 
(11),  we  obtain  the  decision  boundary  dn  between  the  regions  Hn  and  Hn  +  1. 

^  _  B  +  y/B2  +  AC) 

dn  -  A 

A  =  al+1-ol 
B  =  mnal+1  -  mn+lal 

C  =  rn2nal+1-m2n+1(jl  +  2alal+1\na  (12) 
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Using  Property  1 


.  _  ml+i  -m2n-2al\na 

U>r> 


2(mn+i  -  mn) 

For  large  density  A,  Property  5  is  applicable,  (12)  simplifies  to 


j  olmn+ 1  +  al+1mn 
dn  ~ 


Applying  Property  1  to  (14), 


dn  — 


<  +  °n+l 


mn  +  mn+ 1 


(13) 


(14) 


(15) 


No  matter  which  approximate  solution  we  choose  for  dn,  the  decision  rule  is  given  by 


n+1 

r  ^  dn. 

n 


(16) 


In  other  words, 


we  decide  n  if  d„_i  <  r  <  d* 


(17) 


C.  Error  Performance  Analysis 

For  our  decision  rule,  a  decision  error  occurs  only  when  Hn  but  we  decide  hfn.  Thus, 
the  probability  of  error  for  a  specific  r  is 


p(e|r)  = 

7i  /n 


(18) 


where  f(H\r)  is  related  to  /(r|  /f,)  by  the  Bayesian  rule.  The  total  probability  of  error  is 
obtained  by  integrating  (18)  over  all  possible  r. 


P(t)  =  J  pfV)fv{r)dr 


(19) 


According  to  Property  4,  only  f(r\H  =  n  -  1)  and  f{r\H  =  n  +  1)  could  have  outstanding 
value  over  the  decision  region  [dn_x,dn], 

oo  ^7l.. 

p(e)  ~  /  f(r \H  =  n-  1  )p(Hn  -  1)  +  f{r\Hn  +  1  )p(Hn  +  1  )dr 

n= 2  r 


—  1 


dpi- 1  r\(dn  ^7-n— 1 ' 


n=2 


^71—  1 


L)-Q(- 


+a”+1p(Ff„  +  l)[Q(mn+1  dn)  -  Q( 


0"n— 1 
7?7-n_j_i  dn— 


^n-f  1 


^n+l 


(20) 
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Note  that 


dn—i  Tfi-n—  i 

&n— 1 


dn  mn—i 
&n- 1 

dn  -  dn~1  »  1,  (21) 

0?1—  1 


therefore,  is  negligible  compared  to  Q(dn  gn^'~ )•  Similarly,  Q(m'^1+idn)  is 

negligible.  (20)  is  approximated  by 


Ke) 


1,3  is 

+a“+V(fl„  +  IWt”1^1  )1 

0*71+1 

02 

71 — 13 

+Q(d!L=m2i)] 

0*71 


(22) 


Substituting  an  appropriate  solution  of  into  (22)  would  give  us  the  probability  of  error 
within  required  accuracy.  For  example,  if  we  choose  (15), 


P(e) 


2<t2 

+Q(m"-+j-~— )J 


71=3 


TTiji  s 

2cr„ 


(23) 


IV.  Application  Examples 

We  provide  two  application  examples,  latency  and  energy  estimation,  in  this  section.  To 
emphasize  the  role  of  the  number  of  hops  in  the  estimation,  we  use  general  time  and  energy 
models.  On  how  to  derive  the  parameters  such  as  Trx,  Ttx for  a  specific  routing  scheme,  readers 
are  referred  to  [18],  [19]. 

A.  Latency  Estimation 

We  use  a  simple  time  model,  in  which  the  latency  increases  linearly  with  the  number 
of  hops  [20],  Suppose  it  takes  Trx,Ttx  for  a  sensor  node  to  process  1  bit  of  incoming  and 
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r  x 


if 

Fig.  4.  Time  model. 


outgoing  message,  respectively.  And  Tpr  is  the  required  time  to  transmit  1  bit  of  message 
through  a  band-limited  channel.  Therefore,  the  latency  introduced  for  each  hop  is 

Thop  —  Ttx  +  Tpr  +  Trx  (24) 

Shown  in  Fig.  4,  given  the  end-to-end  distance  r,  we  can  find  the  required  number  of  hops 
h  according  to  (16),  thus,  a  good  estimator  of  the  total  latency  of  a  /-bit  message  is 

lhThop  (25) 


B.  Energy  Consumption  Estimation 


The  following  model  is  adopted  from  [21]  where  perfect  power  control  is  assumed.  To 
transmit  /  bits  over  distance  r,  the  sender’s  radio  expends 


Etx(l,r ) 

and  the  receiver’s  radio  expends 


l^elec  Itfs'T'  r*  <C  To 

< 

IE  elec  ^  ^0 


(26) 


Erx  (Z  >  0  —  tEeif>c . 


(27) 


Eeiec  is  the  unit  energy  consumed  by  the  electronics  to  process  one  bit  of  message,  tfs  and 
emp  are  the  amplifier  factor  for  free-space  and  multi-path  models,  respectively,  and  d0  is  the 
reference  distance  to  determine  which  model  to  use.  The  values  of  these  communication 
energy  parameters  are  set  as  in  Table  II. 

Let  sn  denote  the  single-hop  distance  from  the  (n  —  l)th-hop  to  the  nth-hop.  Obviously, 
sn  <  R.  In  our  experimental  setting,  R  =  30m  <  do  so  that  the  free  space  model  is  always 
used.  This  agrees  well  with  most  applications,  in  which  multi-hop  short-range  transmission 
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TABLE  II 


Energy  Consumption  Parameters 


Name 

Value 

To 

86.2m 

Eelec 

50  nJ/bit 

Eda 

5nJ/bit 

ej3 

lOpJ/bit/m 2 

£mp 

0.0013 pJ/bit/m4 

is  preferred  to  avoid  the  exponential  increase  in  energy  consumption  for  long-range  trans¬ 
mission.  Naturally,  the  end-to-end  energy  consumption  for  sending  1  bits  over  distance  r  is 
given  by 

n 

Etotal{l,r)  —  +  Erx(l)}  (28) 

1 

where  n  is  the  estimated  number  of  hops  for  given  r  and  ?’i  is  the  single-hop  distance  because 
the  message  is  relayed  hop  by  hop. 

On  the  average, 

Etotali^f')  ~  ^{,E elec -\~  C f sE\v +  Eeiec) 

=  nl(2Eeiec  +  efs(ml  +  of)  (29) 


C.  Simulation 

We  used  the  same  scenario  described  in  Section  III-A  and  varied  the  node  density  A  and 
transmission  range  R.  In  each  simulation,  the  number  of  hops  is  estimated  for  each  node 
using  (14)  and  (16),  and  then  the  latency  and  energy  consumption  are  estimated  using  (25) 
and  (29),  respectively.  As  comparison  to  our  proposed  statistic-based  estimator,  we  chose  a 
widely  used  linear  estimator. 

Linear  Estimator  1  n  =  [r/R]  +  1, 

Linear  Estimator  2  h—  [r/R]  +  2,  (30) 

where  r  is  the  given  distance,  R  the  transmission  range  and  [r/R]  is  the  maximum  number 
less  than  r/R.  We  plot  the  average  of  latency  and  energy  consumption  in  Fig.5(a)  (b)  and 
the  RMSE  in  Fig. 6(a)  (b),  respectively.  The  latency  is  plotted  in  units  of  Thop  while  the 
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energy  consumption  in  units  of  Joules.  The  ripple  shape  of  RMSE  is  due  to  the  fact  decision 
errors  occurs  more  often  in  the  overlapping  zones  of  neighboring  f(r\Hi).  Fig. 5  show  that 
the  linear  estimator  1  performs  well  at  the  shorter  range  but  suffers  visibly  at  larger  range, 
while  the  linear  estimator  does  the  opposite.  The  linear  estimators,  no  matter  what  value 
their  parameters  take,  may  significantly  underestimate  or  overestimate  the  latency  and  energy 
consumption  as  already  pointed  out  in  Section  III-A,  while  our  statistic-based  model  keeps 
close  to  the  actual  latency  and  energy  consumption  at  all  ranges  except  for  the  border.  This 
is  also  verified  by  Fig.6,  which  also  shows  that  our  model  can  reduce  RMSE  to  at  least 
half  for  both  latency  and  energy  consumption.  These  results  show  that  linear  models  cannot 
identify  network  behavior  accurately,  as  also  confirmed  by  our  extensive  simulations  for 
different  settings  of  node  density  and  transmission  range,  which  is  not  shown  here  due  to 
space  constraints. 


Fig.  5.  Estimation  Average,  (a)  Latency,  (b)  Energy  consumption. 


V.  Conclusion 

To  address  the  fundamental  problem  “how  many  hops  does  it  take  for  a  packet  to  be 
relayed  for  a  given  distance?”,  we  make  both  probabilistic  and  statistic  studies.  We  proposed 
a  Bayesian  decision  based  on  the  conditional  pdf  of  f(r\Hi).  Since  f(r\Hi)  is  computationally 
complex,  we  also  proposed  an  attenuated  Gaussian  approximation  for  the  conditional  pdf, 
which  visibly  simplifies  the  decision  process  and  the  error  analysis.  We  also  show  that  several 
linear  models,  though  intuitively  sound  and  widely  used,  may  give  significant  bias  error. 
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Fig.  6.  Estimation  RMSE.  (a)  Latency,  (b)  Energy  consumption. 


We  apply  our  approximation  to  the  latency  and  energy  consumption  estimation  in  dense 
WSN.  Simulations  show  that  our  approximation  model  can  predict  the  latency  and  energy 
consumption  with  less  than  half  RMSE,  compared  to  the  aforementioned  linear  models. 
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Abstract — We  model  the  end-to-end  distance  for  a  given 
number  of  hops  in  dense  planar  Wireless  Sensor  Networks  in 
this  paper.  We  derive  that  the  closed-form  formula  for  single¬ 
hop  distance  and  postulate  Beta  distribution  for  2-hop  distance. 
When  the  number  of  hops  increases  beyond  three,  the  multi¬ 
hop  distance  approaches  Gaussian.  The  Gaussian  approximation 
model  is  also  applied  to  ranging,  which  achieves  less  distance 
error  than  Hop-TERRAIN  and  APS  (Ad  hoc  Positioning  System). 
Our  error  analysis  also  shows  the  distance  error  is  be  minimized 
by  using  our  model. 

I.  Introduction  and  Motivation 

In  Wireless  Sensor  Networks  (WSN),  knowledge  of  node 
location  is  often  required  in  many  applications.  The  examples 
include  events  report,  target  tracking,  geographical  routing, 
and  coverage  evaluation.  Generally,  the  distances  from  a  node 
with  unknown  location  to  several  anchor  nodes  are  estimated, 
and  then  a  multilateration  is  applied  to  estimate  the  node 
location.  Distance  is  often  estimated  based  on  received  signal 
strength,  time  of  arrival  (TOA),  time  difference  of  arrival 
(TDOA)  or  angle  of  arrival  [1],  The  angle-of-arrival  based 
ranging  requires  directive  antennas  or  arrays,  which  is  not 
suitable  for  most  microsensors.  Similarly,  measuring  time  of 
flight  requires  timing  device  with  satisfactory  resolution  like 
in  GPS.  Although  TDOA  needs  much  less  resolution,  it  often 
requires  extra  acoustic  or  ultrasound  emission,  which  comes 
with  higher  price,  larger  size  and  more  energy  consumption,  all 
seeming  impractical  for  microsensors.  Thus,  most  technically 
available  ranging  is  based  on  received  signal  strength;  in  fact, 
RSSI  (Received  Signal  Strength  Indication)  is  widely  used  in 
wireless  communications  to  provide  distance  estimation. 

The  underlying  observation  is  that  the  average  large-scale 
path  loss  can  be  expressed  as  a  function  of  distance  by  using 
a  path  loss  exponent,  n  [2]. 

PL(d)  =  PL(do)(~)n  (1) 

d0 

where  n  is  the  path  loss  exponent,  which  indicates  the  rate  at 
which  the  path  loss  increases  with  distance,  do  is  the  close- 
in  reference  distance,  which  is  determined  from  measurement 
close  to  the  transmitter,  and  d  is  the  distance  from  the  source 
to  the  receiving  point.  Measurements  have  also  shown  that  at 
any  value  of  d,  the  path  loss  PL(d)  at  a  particular  location 
is  random  and  distributed  log-normally  (normal  in  dB)  about 


the  mean  distance-dependent  value. 

PL{d)  [dB]  =  PL(d )  [dB]  +  ,  (2) 

where  Xa  is  a  zero-mean  Gaussian  distributed  random  variable 
(in  dB)  with  standard  deviation  a  (also  in  dB).  The  log-normal 
shadowing  is  the  main  source  of  distance  error  for  received- 
signal-strength-based  ranging  methods.  The  values  of  n  and 
a  are  often  estimated  empirically,  for  example,  n  could  vary 
from  2  to  10  for  different  environments,  and  typical  value  of 
a  in  urban  area  is  around  10  dBs. 

Due  to  the  log-normal  shadowing,  the  RSS-based  ranging 
could  be  very  rough.  For  example,  the  median  localization 
error  of  commodity  802.11  technology  is  10/t  [3];  such  ac¬ 
curacy  may  be  achieved  by  alternative  techniques,  for  example, 
exploiting  the  dense  deployment  to  estimate  distance  between 
nodes.  For  those  applications  where  the  sensor  nodes  are  over- 
densely  deployed,  the  distance  between  the  nodes  are  short 
and  the  variance  of  such  distance  is  also  small.  Therefore,  it 
is  quite  promising  to  estimate  the  end-to-end  distance  based 
on  the  number  of  hops  [4]-[6]. 

For  example,  both  APS  [4]  and  Hop-TERRAIN  [6]  find  the 
number  of  hops  from  a  node  to  each  of  the  anchors  and  then 
multiply  this  hop  count  by  a  shared  metric  (average  single¬ 
hop  distance)  to  estimate  the  range  between  the  node  and 
each  anchor.  The  known  positions  of  anchor  nodes  and  these 
computed  ranges  are  then  used  to  perform  a  triangulation  to 
obtain  estimated  node  positions.  A  further  refinement  phase 
is  proposed  in  [6],  which  uses  least  squares  on  local  com¬ 
putation.  However,  as  we  show  later,  the  distance  does  not 
increase  linearly  with  the  number  of  hops.  Therefore,  a  better 
knowledge  about  the  hop-distance  relationship  can  cast  new 
light  on  distance  estimation. 

In  [7],  Hou  and  Li  studied  the  2-D  Poisson  distribution  to 
find  an  optimal  transmission  range.  They  found  that  the  hop- 
distance  distribution  is  determined  not  only  by  node  density 
and  transmission  range  but  also  by  the  routing  strategy.  They 
showed  results  for  three  routing  strategies,  Most  Forward 
with  Fixed  Radius,  Nearest  with  Forward  Progress,  and  Most 
Forward  with  Variable  Radius.  Cheng  and  Robertazzi  in  [8] 
studied  the  one-dimension  Poisson  point  and  found  the  pdf 
of  r;  and  the  dependency  of  rr  on  previous  rj,  j  <  i. 
They  also  pointed  out  the  2-D  Poisson  point  distribution  is 
analogous  to  the  1-D  case,  replacing  the  length  of  the  segment 


by  the  area  of  the  range.  Vural  and  Ekici  reexamined  the 
study  of  1-D  Poisson  distribution  under  the  sensor  networks 
circumstances  in  [9],  and  proposed  to  approximate  the  multi¬ 
hop  distance  using  Gaussian.  In  this  paper,  we  study  the  hop- 
distance  relation  in  the  planar  WSN. 

The  rest  of  this  paper  is  organized  as  follows.  We  provide 
some  preliminaries  on  statistical  methods  in  Section  II.The 
probabilistic  study  is  presented  in  Section  III  and  statistical 
analysis  in  Section  IV.  Section  V  concludes  this  paper. 

II.  Preliminaries 

In  this  section,  we  provide  some  preliminaries  on  statistical 
methods  [10]. 

A.  Skewness  and  Kurtosis 

Skewness  is  a  measure  of  symmetry,  or  more  precisely,  the 
lack  of  symmetry.  A  distribution,  or  sample  set,  is  symmetric 
if  it  looks  the  same  to  the  left  and  right  of  the  center  point. 
Definition  1:  [10]  For  a  given  sample  set  X, 

m3  =  E(X-X)3/n ,  (3) 

m2  =  E  {X-X)2/n,  (4) 

where  X  is  the  sample  mean  of  X,  and  n  is  the  size  of  X. 
Then  a  sample  estimate  of  skewness  coefficient  is  given  by 

_  ^3  fc\ 

9 1  =  —  •  (5) 

m2 

Skewness  is  zero  for  a  symmetric  distribution.  Positive  skew¬ 
ness  indicates  right  skewness  and  negative  indicates  left. 

Kurtosis  is  a  measure  of  whether  the  data  are  peaked  or  flat 
relative  to  a  normal  distribution. 

Definition  2:  [10]  A  sample  estimate  of  kurtosis  for  a 

sample  set  X  is  given  by 

<?2  =  rrii/ml  -  3,  (6) 

where  mu  =  E(Af  -  X)4/n  is  the  fourth-order  moment  of  X 
about  its  mean. 

Skewness  and  kurtosis  is  useful  in  determining  whether  a 
sample  set  is  normal.  Note  that  both  the  skewness  and  kurtosis 
of  a  normal  distribution  are  zero;  significant  skewness  and 
kurtosis  clearly  indicate  that  data  are  not  normal. 

B,  Chi-Square  Test 

Chi-square  test  is  widely  used  to  determine  the  goodness  of 
fit  of  a  distribution  to  a  set  of  experimental  data.  It  works  as 
follows: 

.  1.  Partition  the  sample  space  into  the  union  of  K  disjoint 
intervals. 

•  2,  Compute  the  probability  bk  that  an  outcome  falls  in  the 
/cth  interval  under  the  postulated  distribution.  The  mk  = 
nbk  is  the  expected  number  of  outcomes  that  fall  in  the 
/cth  interval  in  n  repetitions  of  the  experiment. 

•  3.  The  chi-square  statistic  is  defined  as  the  weighted 
difference  between  the  observed  number  of  outcomes, 


A/fc,  that  fall  in  the  /cth  interval,  and  the  expected  number 


mk. 


=  ££=1 


K  (Nk  -  ™*:)2 


mk 


(7) 


•  4.  The  hypothesis  is  rejected  if  D 2  >  ta,  where  ta 
is  a  threshold  determined  by  a  given  significance  level. 
Otherwise,  the  fit  is  considered  good. 


III.  Modeling  End-to-end  Distance  for  Given 
Number  of  Hops 

A.  Problem  Formidation 

We  assume  a  general  beacon  scenario,  in  which  anchors 
sends  out  beacon  packets  informing  other  nodes  about 
their  locations.  These  beacon  packets  are  also  relayed  so 
that  nodes  outside  the  anchors’  transmission  range  could 
also  received  the  beacons.  Suppose  the  sensor  nodes  are 
placed  on  a  plane  at  random  at  an  average  density  of  A 
nodes  per  square  meters.  Nonetheless,  clarifications  about 
several  terms  are  necessary,  because  they  have  been  used 
in  a  wide  variety  of  senses. 

Firstly,  our  study  on  end-to-end  distance  for  given  number 
of  hops  is  based  on  local  coordinate  system,  which 
could  be  translated  into  a  global  coordinate  system  if 
enough  nodes  in  the  local  coordinate  system  have  known 
global  coordinates.  In  previous  research,  anchors  refer 
to  beacons,  whose  locations  are  known  and  broadcast  to 
other  nodes.  However,  in  our  study,  an  anchor  is  simply 
a  specific  node  used  in  establishing  the  local  coordinate 
system.  An  anchor  could  have  global  coordinates  or  not, 
which  is  of  no  interest  to  our  study.  Therefore,  our 
study  is  applicable  to  both  anchor-based  and  anchor-free 
approaches. 

Secondly,  we  assume  the  beacon  packets  are  distributed 
in  an  ad  hoc  fashion.  Although  better  routing,  such  as 
geographic  routing,  are  proposed  for  WSN,  they  are  not 
suitable  for  relaying  beacon  packets,  because  during  this 
phase,  most  nodes  have  no  knowledge  about  locations 
of  their  own  and  neighbors’.  Under  such  circumstances, 
we  have  to  assume  the  beacon  packets  are  simply  flooded 
throughout  the  sensor  network,  except  that  nodes  can  only 
relay  the  beacon  packets  incoming  with  least  number  of 
hops  and  discard  those  via  more  hops. 

Let  N(A)  be  the  number  of  nodes  in  area  A,  it  can  be 
shown  that  N(A)  is  a  two-dimensional  Poisson  point 
process  with  density  A.  One  property  of  the  Poisson 
process  is  that  if  the  number  of  nodes  occurring  in  the 
area  A  is  N,  then  the  individual  outcomes  are  distributed 
independently  and  uniformly  in  the  area  A.  That  is,  if 
N  nodes  are  placed  at  random  in  the  area  A,  then  the 
probability  of  a  specific  node  in  the  subarea  B  is  B/A, 
given  B  e  A. 

Assume  the  area  A  is  large  enough  so  that  none  of  the 
anchor  nodes  is  near  the  border  and  the  transmission 
range  is  R.  The  problem  of  interest  is  to  find  the  distance 
from  a  specific  node  to  the  anchor  given  this  node  is 
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TABLE  I 

Definition  of  Variables 


(16) 


Variable 

Definition 

f  -  (r,  6) 

the  polar  coordinates  of  a  node 

u 

the  distance  from  the  ( i  —  l)-hop  node  to  the  i-hop  node 

Hi 

the  event  “the  specific  node  is  within  i  hops,  but  beyond  (z  -  1) 
hops  from  the  source.” 

within  i  hops  from  the  anchor.  The  definitions  of  variables 
are  listed  in  Table  I.  Note  that  the  event  Hi  can  also 
be  described  as  “the  minimum  number  of  hops  from  the 
anchor  to  the  specific  node  is  i". 

B.  Single-Hop  Case 

Consider  the  first  hop  case,  the  conditional  cdf  can  be 
expressed  by 

P[n  <  r\Hi]  =  P[r i  <  r|n  <R\  =  ^  (8) 

Taking  derivative, 

/( r1|Hi)W  =  f  (9) 

And  the  conditional  mean  and  variance  are  277/3  and 
R?/ 18,  respectively,  which  are  solely  determined  by 
the  transmission  range  R  and  irrelevant  to  the  node 
distribution  density  A.  This  is  due  to  the  uniform  node 
distribution;  no  matter  how  large  the  density  could  be, 
it  would  not  give  any  bias  to  the  conditional  mean  and 
variance. 

C.  Two -Hop  Case 

Conditional  on  the  value  of  r  j,  the  cdf  for  £2  is 

P{t2<t\r1\H2)=-Jjp,  (10) 

where  B  is  the  area  of  the  region  inside  the  circle  of 

center  fi)  but  outside  the  circle  of  center  fib.  B  is  equal 

to 

m(t2)2  -  (ii)2(^i  -  ^  sin  20i )  -  (f2)2(0 2  -  ^sin2<£2), 

(11) 

where 

h  =  eos-^l-^),  (12) 

fa  =  cos_1(^-).  O3) 

The  conditional  pdf  of  f2  is  obtained  by  taking  the 
derivative  of  (10). 

/t2|r,|H2W  =  ^^2>  (14) 

By  taking  expected  value  of  (14), 

WO-f  05) 


7-2  is  determined  by 

r2  =  y/ (fi)2  +  (t2)2  -  2*1*2  cos  fa 


where  <p  is  the  angle  between  £1  and  £2  and  uniformly 
distributed  in  [-fa,  fa}.  Although  it  is  possible  to  derive 
the  pdf  of  r2  from  (16),  it  is  awkward  to  evaluate 
explicitly.  Furthermore,  note  that  rn  depends  on  rn_i, 
a  nested  integral  as  in  (17)  is  generally  required  for  such 
evaluations.  Thus,  for  the  end-to-end  distance  for  two  and 
more  hops,  we  will  postulate  their  distribution  from  the 
collected  simulation  data  in  the  next  section. 


p(rn\H) 


(n—l)R(n—2)R  R 


/  /  "'/P(rnlri’r2’"' 

R  R  0 

/(r„-i|ri,r2,---  ,rs_2,77)  •  •  • 

f{ri\H)dri  ■  ■  •  drn_2drn_i  (17) 


Fig.  1.  Two-hop  distance. 


IV.  Statistical  Analysis 

All  the  simulation  data  are  collected  from  such  a  scenario 
that  N  sensor  nodes  were  uniformly  distributed  in  a 
circular  region  of  radius  of  300  meters.  For  convenience, 
polar  coordinates  were  used.  The  anchor  node  was  placed 
at  (0, 0).  We  ran  simulations  for  extensive  settings  of  node 
density  A  and  transmission  range  R.  And  for  each  setting 
of  (N,  R),  we  ran  300  simulations,  in  each  of  which  all 
nodes  are  re-deployed  from  the  beginning. 

A.  Single-Hop  Distance 

We  plot  the  histogram  of  single-hop  distance  collected 
from  simulations  and  compare  with  the  theoretical  result 
(9)  in  Fig.  2  (a),  which  clearly  shows  that  (9)  fits  the 
experimental  data  very  well.  Furthermore,  a  chi-square 
test  was  carried  out  to  determine  the  goodness  of  fit  of 
(9)  to  the  experimental  data.  The  threshold  for  30- 1  =  29 
degrees  of  freedom  at  a  1%  significance  level  is  49.59. 
Compared  to  this,  D 2  =  28.8728  is  well  within  the 
threshold.  Thus,  we  establish  that  the  data  is  in  good 
agreement  with  (9). 
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Fig.  2.  The  histogram  vs.  postulated  distribution  for  end-to-end  distances  for 
given  number  of  hops,  (a)  One-hop.  (b)  Two-hop.  (c)  Three-hop.  (d)  Four-hop. 
(e)  Five-hop.  (f)  Six-hop. 


B.  Two-Hop  End-to-end  Distance 

Since  there  is  no  closed-form  formula  for  the  conditional 
pdf  of  end-to-end  distance  for  two  and  more  hops,  we 
have  to  find  a  fit  for  it.  We  postulate  the  following  pdf 
for  the  conditional  pdf  of  two-hop  end-to-end  distance 
according  to  the  experimental  data  plotted  in  Fig.  2  (b), 
whose  characteristic  curve  clearly  shows  a  Beta  distribu¬ 
tion  shape.  The  general  pdf  of  Beta  distribution  is 

fx(x)  =  C(x-ay-1(b-x)‘>-1,  (18) 


where  p  and  q  are  the  shape  parameters,  a  and  b  are  the 
lower  and  upper  bounds,  and  C  is  a  numerical  factor  to 
make  the  complete  probability  one.  The  bounds  a  and  b 
can  be  easily  determined  as  a  =  0  and  b  =  2 R.  Since 
the  maximum  of  (18)  occurs  at  b(p  -  l)/(p—  1  +  q—  1), 
which  is  at  2>R/2  in  Fig.  2  (b),  therefore,  p  =  4  and  q  =  2 
would  be  a  good  guess.  The  remaining  parameter  C  is 
determined  by 


f2R  (2R-s)s* 

Jr  B(4,2)(2  Ry 


(19) 


The  postulated  Beta  distribution  and  histogram  are  drawn 
together  in  Fig.  2  (b),  which  clearly  shows  a  close  match. 


TABLE  II 

Means  and  Stds  for  Three- And-More-Hop  End-to-end 
Distances 

Number  of  Hops  |  Mean  |  Sid  |  Skewness  [  Kurtosis 


3 

72.01 

8.2129 

BUMS 

-1.0332 

4 

99.45 

8.391 

-0.079383 

-0.97857 

5 

127.14 

8.5323 

-0.064453 

-0.93104 

6 

154.96 

8.6147 

-0.053416 

-0.9004 

(See  Fig.  2  (d)(e)(f)).  For  a  more  formal  analysis  about 
its  Gaussianity,  we  list  their  skewness  and  kurtosis  in 
Table  II.  Note  that  both  skewness  and  kurtosis  are  well 
within  tolerance,  we  postulate  Gaussian  distribution  for 
three-and-more-hop  end-to-end  distance.  The  mean  and 
std  can  be  estimated  from  the  experimental  data  (see 
Table  II).  We  plot  the  postulated  Gaussian  distribution 
and  histogram  together  in  Fig.  2  (d)(e)(f),  which  clearly 
show  a  close  match  for  each  case. 

D.  Optimum  Estimation  and  Error  Analysis 

Once  the  condition  pdf  is  known,  the  distance  estimation 
is  straightforward.  The  optimum  unbiased  estimator  is 
E\rn\Hn],  and  accordingly,  the  RMSE  can  be  minimized 
to  y/VAR[rn\Hn}.  In  APS/Hop-TERRAIN,  the  distance 
is  assumed  to  increase  linearly  with  the  number  of  hops, 
and  thus,  a  linear  estimator,  n  *  ml,  is  used  to  estimate 
nth-hop  distance.  Accordingly,  the  MSE  of  the  linear 
estimator  is  given  by 

MSE(Hn)  =  %-nm,)2] 

=  VAR[rn\Hn]  +  (mn-nm1  )2(20) 

The  minimum  RMSE  and  the  biased  RMSE  given  by 
APS/Hop-TERRAIN  estimator  are  plotted  together  in 
Fig.3,  which  increases  drastically  even  when  n  is  only 
moderately  large.  As  discussed  in  the  Introduction,  there 


Fig.  3.  The  RMSE  bias  vs.  the  number  of  hops.  R=30m 


C.  Three-And-More-Hop  End-to-end  Distance 

When  the  number  of  hops  increases  beyond  three,  the 
end-to-end  distance  distribution  approaches  Gaussian 
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exists  a  lower  bound  of  distance  error  for  the  RSS- 
based  ranging  technology.  According  to  [3],  the  median 
localization  error  of  commodity  802.11  technology  is 


10/i  «  3.05m.  The  RMSE  we  obtain  in  our  simulations 
is  around  8  meters,  which  is  in  the  same  order  of  mag¬ 
nitude  as  the  distance  error  bound  in  [3].  Furthermore, 
in  environment  with  irregular  terrain,  obstacles  or  other 
clutters,  the  shadowing  effect  may  cause  higher  distance 
error.  Since  hop-based  distance  technology  is  immune  to 
shadowing  effect,  it  may  outperform  RSS-base  ranging 
in  these  kinds  of  environment. 

V.  Conclusions 

In  this  paper,  we  study  the  modeling  of  the  end-to- 
end  distance  for  given  number  of  hops  in  WSN.  The 
experiments  showed  that  the  distance  does  not  increase 
linearly  with  the  number  of  hops.  Therefore,  the  distance 
should  be  analyzed  for  each  number  of  hops.  We  derived 
the  distribution  for  single-hop  distance  and  also  showed 
that  the  complexity  of  derivation  for  multiple-hop  dis¬ 
tance  is  beyond  practical  interest.  Thus,  we  postulate  Beta 
distribution  for  two-hop  end-to-end  distance  and  Gaussian 
distribution  for  three-and-more-hop  end-to-end  distance. 
Computer  simulations  showed  our  postulated  distribu¬ 
tions  agree  well  with  the  histograms.  We  also  show  that 
the  distance  error  can  be  minimized  by  exploiting  the 
distribution  knowledge. 
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Abstract — In  this  paper,  we  propose  a  MAC  protocol: 
throughput-maximized  MAC  protocol  (TM-MAC),  based  on  the 
characteristics  of  ultra  wideband  (UWB)  technology.  In  UWB 
communication  systems,  the  transmission  parameters  are  tun¬ 
able  to  match  the  requirements  of  data  flow.  In  TM-MAC, 
we  implement  concurrent  multiuser  access  instead  of  mutual 
exclusion  method,  such  as  TDMA  and  random  access.  For 
multiuser  interference,  we  establish  a  model  to  adaptively  adjust 
the  data  transmission  rate  to  ensure  a  satisfied  signal  to  noise 
ratio  (SNR)  at  receiver  side.  We  also  analyze  the  relationship 
among  the  theoretical  maximum  channel  capacity,  the  achievable 
maximum  channel  capacity  and  the  maximum  data  transmission 
rate.  According  to  the  network  topology,  TM-MAC  re-divides 
network  into  subsets,  in  which  communication  pairs  can  make 
communication  simultaneously  to  enhance  throughput  and  to 
exploit  as  fast  as  possible  data  transmission  rate  for  reliable 
communication.  For  subset  formation,  we  propose  a  general 
analytical  framework,  which  captures  the  unique  characteristics 
of  shared  wireless  channel  and  allows  to  model  a  large  class  of 
systemwide  throughput  maximization  issue  via  the  specification  of 
per-link  utilization  functions.  Simulation  results  demonstrate  that 
TM-MAC  can  implement  throughput  maximization  to  shorten 
latency  and  to  enhance  network  processing  capability. 

I.  Introduction 

A  wireless  sensor  network  (WSN)  can  be  thought  as  an 
ad  hoc  network  consisting  of  sensor  nodes  linked  by  a  wire¬ 
less  medium  to  perform  distributed  sensing  tasks.  Distributed 
WSNs  have  increasing  potential  applications  because  they  hold 
the  potential  to  revolutionize  many  segments  of  our  economy 
and  life,  from  environmental  monitoring  and  conservation  to 
manufacturing  and  business  asset  management.  Ultra  wide¬ 
band  (UWB)[1]  is  an  attractive  technology  for  WSNs  due  to 
its  extremely  high  data  transmission  rate,  low  radiated  power, 
accurate  range  resolution,  precision  distance  and/or  positioning 
measurement  capabilities,  relatively  immune  to  multipath  can¬ 
cellation  effects,  as  well  as  simple  UWB  transceiver  devices. 

As  a  general  principle,  the  role  of  medium  access  control 
(MAC)  module  is  to  allow  multiple  users  to  share  a  common 
resource.  Most  of  existed  wireless  MAC  protocols  assume 
that  simultaneous  transmissions  result  in  transmission  errors 
and  thus  employ  mutual  exclusion  mechanisms  to  avoid  them. 
Moreover,  from  layered  architecture  aspect,  the  functions  exe¬ 
cuted  by  MAC  should  be  defined  without  taking  into  account 
the  underlying  physical  layer,  which  is  seen  by  MAC  as  a 


black  box  offering  the  service  of  transferring  bits  in  the  form  of 
signals  appropriate  for  channel.  From  this  view,  there  are  some 
MAC  protocols  appeared  for  UWB  communication  systems.  In 
European  funded  project  U.C.A.N  (Ultra  wideband  Concepts 
for  Ad-hoc  Networks),  a  TDMA-based  MAC  protocol  is 
proposed.  This  MAC  protocol  is  an  adaptation  for  UWB 
from  IEEE  802. 1 5.3  [2]  draft  standard  for  narrow-band  wireless 
personal  area  network  (WPAN).  In  [3],  they  proposed  a  single 
transceiver  approach  for  UWB  and  a  companion  MAC  layer 
based  on  busy  tone  multiple  access  (BTMA).  BTMA  reduces 
the  time  and  the  energy  spent  on  collision  as  compared  to 
handshaking  protocols. 

However,  the  design  of  a  truely  efficient  MAC  protocol 
for  UWB  systems  should  investigate  some  possible  MAC 
enhancements  that  will  take  into  account  the  inherent  ad¬ 
vantages  of  UWB  technology.  In  [4],  they  proposed  scheme 
providing  distributed  medium  access  through  pulse  sense, 
which  is  similar  to  carrier  sense  in  narrowband  systems. 
Moreover,  UWB  is  flexible  in  the  reconfiguration  process  of 
data  transmission  rate.  Thus,  opposite  to  mutual  exclusion 
MAC  protocols,  another  approach  for  MAC  protocol  design  is 
issued.  This  kind  of  MAC  protocol  for  UWB  systems  allows 
simultaneous  transmission  and  adapts  to  multiuser  interfer¬ 
ence.  F.  Cuomo  et  al.  [5]  outlined  key  issues  to  design  a  multi 
access  scheme  based  on  UWB.  They  selected  a  distributed 
mechanism  to  handle  radio  resource  sharing,  and  presented  a 
general  framework  of  radio  resource  sharing  to  UWB  wireless 
ad  hoc  networks. 

In  this  paper,  starting  from  the  inherent  characteristics  of 
UWB  technology  and  cross-layer  design  approach,  we  propose 
a  MAC  protocol:  throughput-maximized  MAC  protocol  (TM- 
MAC).  We  combine  MAC  layer  and  physical  layer  together 
to  optimize  network  throughput.  At  MAC  layer,  network  is 
re-divided  into  subsets  based  on  TM-MAC.  In  each  subset, 
communication  pairs  can  make  communication  simultaneously 
to  enhance  throughput  and  to  exploit  as  fast  as  possible 
data  transmission  rate  for  reliable  communication.  In  subset 
formation,  we  propose  a  general  analytical  framework  that 
captures  the  unique  characteristics  of  shared  wireless  channel 
and  allows  to  model  a  large  class  of  systemwide  throughput 
maximization  issue  via  the  specification  of  per-link  utilization 


functions.  At  physical  layer,  we  analyze  the  connection  among 
the  theoretical  maximum  channel  capacity,  the  achievable 
maximum  channel  capacity  and  data  transmission  rate.  For 
multiuser  interference,  we  establish  a  model  to  adaptively 
adjust  the  maximum  data  transmission  rate  to  ensure  a  satisfied 
signal  to  noise  ratio  (SNR)  at  the  receiver  side. 

The  remainder  of  this  paper  is  organized  as  follows.  In 
Section  II,  we  summarize  motivations  for  our  work.  We 
analyze  the  impact  of  simultaneous  transmission  on  system’s 
transmission  capability  in  Section  III.  Section  IV  describes 
our  TM-MAC  algorithm.  Simulation  results  are  given  in 
Section  VI.  Section  VI  concludes  this  paper. 

II.  Our  Motivation 

A.  Energy  Constraint 

One  of  the  biggest  challenges  for  designers  of  WSNs 
is  to  develop  systems  that  will  run  unattended  for  years. 
This  calls  for  not  only  robust  hardware  and  software,  but 
also  lasting  energy  resources.  However,  current  generation  of 
sensor  nodes  is  battery  powered,  whose  available  energy  is 
limited,  and  replacing  or  recharging  batteries,  in  many  cases, 
may  be  impractical  or  uneconomical.  Thus,  protocols  and 
applications  designed  for  WSNs  should  be  highly  efficient 
and  optimized  in  terms  of  energy.  For  WSNs,  communication, 
not  only  transmitting,  but  also  receiving,  or  merely  scanning 
the  channel  for  communication,  can  use  up  to  half  energy[6]. 
Therefore,  recently,  some  researchers  have  begun  studying  the 
problem  of  reducing  power  consumption  on  wireless  interface. 

Considering  that  UWB  is  a  low  radiated  power  communi¬ 
cation  scheme.  The  research  in  [7]  points  out  that  a  bit  rate 
of  lOOKbps  over  5  meters  with  no  more  than  1  mW  power 
consumption.  UWB  is  the  idlest  choice  in  terms  of  energy  ef¬ 
ficiency  for  energy-constraint  WSNs.  However,  seldom  WSNs 
take  advantage  of  properties  of  UWB  technology  because  of 
the  lack  of  an  efficient  MAC  technology. 

Related  with  energy  constraint  problem  for  WSNs,  we  have 
done  some  work  in  [8]  and  [9].  Our  proposed  energy-efficient 
MAC  protocols:  A-MAC  and  ASCEMAC,  not  only  reserve 
energy  to  extend  network  lifetime  but  also  own  the  capability 
to  solve  accumulative  clock  drift  problem  without  network 
synchronization.  While,  both  of  them  only  suit  narrow-band 
communication  systems  well.  In  this  research,  we  extend  our 
previous  work  into  UWB  communication  systems. 

B.  Physical  Layer  Property 

In  UWB  communication  systems,  even  though  bandwidth 
is  finite,  the  bandwidth  is  at  least  500 MHz  or  the  transmitted 
signal  has  a  lOdB  bandwidth  larger  than  20  percent  of 
its  center  frequency  based  on  FCC  standardjl].  Compared 
to  narrowband  systems,  information-theoretic  results  in  [10] 
and  [11]  show  that  a  Shannoncapacity  of  a  multipath  fading 
AWGN  wideband  channel  is  a  linear  function  of  SNR  as 
shown  in  (1). 

C  =  £  x  SNR  (1) 

Moreover,  for  UWB  systems,  data  transmission  rate  R  can 
be  formulated  using  (2).  Here  Ns  is  the  number  of  pulses 


transmitted  at  the  pulse  repetition  time,  Nh  is  the  number  of 
frames  (pulses)  per  information  bit  and  Tc  is  the  bin  duration. 


Note  that,  UWB  is  flexible  in  the  reconfiguration  process  of 
data  transmission  rate  due  to  the  availability  of  a  number  of 
transmission  parameters,  such  as  Ns  Nh  and  Tc,  which  can 
be  tuned  to  better  match  the  requirements  of  a  data  flow. 
Therefore,  for  UWB  systems,  a  consequence  of  rate  adaptation 
is  that  an  arbitrary  level  of  interference  is  possible.  That  is, 
for  a  given  level  of  interference  at  the  receiver  side,  a  sender 
can  tune  its  rate  by  adjusting  the  code,  in  order  to  achieve  a 
desired  bit  error  rate  (BER). 

III.  Theoretic  analysis  on  throughput 

A.  Network  Model 

The  network  model  assumed  in  this  paper  is  that  of  “pi- 
conet”  or  clustered  architecture.  We  assume  that  we  can  set 
up  this  hierarchical  topology  for  a  randomly  deployed  network 
through  certain  existing  methods,  such  as  energy-efficient  self¬ 
organization  (ESO)  [12]  algorithm.  In  each  piconet,  there  is  a 
cluster  head  and  a  multitude  of  normal  nodes.  Communication 
range  determines  the  radius  of  piconet.  Two  nodes,  which  are 
only  one  hop  apart,  are  neighbors  and  can  communicate  with 
each  other  directly.  Our  proposed  algorithm  is  responsible  for 
controlling  wireless  medium  (or  the  common  channel)  access. 

B.  Physical  Layer  Model 

In  this  paper,  we  utilize  binary  PPM-TH-UWB  to  generate 
the  transmitted  signal  a :{t).  That  is,  we  utilize  pulse  position 
modulation  (PPM)  to  implement  modulation  and  time  hopping 
(TH)  to  shape  the  spectrum  of  the  generated  signal. 

OO 

x{t)  =  Ptt  ~  iTf  ~  ciTc  ~  aJ£)  (3) 

j—  —  oo 

where  p(t)  denotes  a  unit-energy  pulse.  Tj  is  the  frame 
duration.  Cj  is  the  dedicated  pseudorandom  code.  a,j  is  the 
binary  bit  of  transmitted  symbols,  e  is  transmission  time  delay 
for  each  pulse. 

We  consider  the  multi-path-affected  UWB  radio  channel 
for  signal  propagation.  In  this  paper,  we  use  IEEE  802.15.3a 
channel  model[13].  The  signal  r(t)  at  the  receiver  side  can  be 
expressed  as  following: 

L(t) 

r(t)  =  aj(t)x(t  -  Tj)  +  u>{t)  +  n(t)  (4) 

3  = 1 

where  ay(t)  and  Tj(t)  are  the  channel  gain  and  the  delay 
measured  at  time  t  for  the  jth  path.  L(t)  is  the  number  of 
path  observed  at  time  t.  ui(t)  is  the  multiuser  interference. 
n{t)  is  AWGN  noise. 

Considering  a  network  with  N  communication  pairs  com¬ 
posed  of  UWB  terminals,  each  pair  consists  of  one  transmitter, 


one  receiver  and  uses  one  pseuodorandom  code.  If  N  links  are 
active,  then  the  signal  to  interference  and  noise  ratio  (SINK) 
at  the  ith  link’s  receiver  at  time  t  is  formed  as  following[5]: 

SINRi  = - -  (5) 

Ri{vi  +  Tf<J2  Efc=l,Mi  Pk9ki} 

where  Ri  is  the  data  transmission  rate  of  the  ith  link;  P*  is 
the  average  power  emitted  by  the  ith  link’s  transmitter;  is 
the  path  gain  from  the  ith  link’s  transmitter  to  the  jth  link’s 
receiver;  rn  is  the  background  noise  energy  plus  interference 
from  other  non-UWB  systems;  a2  is  an  adimensional  param¬ 
eter  depending  on  the  shape  of  monocycle. 

In  this  case,  the  total  multi-path  gain  g,  which  measures  the 
total  amount  of  energy  collected  over  L  pulses,  is  determined 


as  following:  g  =  Ei=i 


Note  that  g  is  related 


with  the  attenuation  suffered  by  the  transmitted  pulses  during 
propagation.  In  multi-path  environments,  g  decreases  with 
distance  according  to  the  path-loss  model  as  following[14]: 


9  d? 


where  go  is  the  reference  value  for  power  gain  evaluated  at 
do  =  lm  and  P  is  the  exponent  of  power  or  energy  attenuation 
law. 

C.  The  Impact  of  Simultaneous  Transmission  on  Network 
Throughput 

Within  a  network,  piconets  can  be  treated  as  independent 
with  each  other,  since  the  distance  among  piconets  is  far 
enough  to  permit  us  ignore  the  interference  among  them 
(piconet  formation  scheme  can  ensure  this  assumption  to  be 
held).  Therefore,  the  analysis  on  network  throughput  can  be 
simplified  into  the  independent  analysis  on  the  throughput 
for  individual  piconets.  Then  combining  all  results  together 
linearly,  we  can  obtain  the  impact  of  simultaneous  trans¬ 
mission  on  network  throughput.  Thus,  in  the  following  part, 
the  discussion  mainly  focuses  on  the  impact  of  simultaneous 
transmission  on  throughput  within  a  piconet. 

In  this  paper,  the  theoretic  maximum  channel  capacity 
Ct-max  is  defined  as  the  largest  channel  capacity  implied 
by  channel  situation,  and  the  achievable  maximum  channel 
capacity  Ca-max  is  defined  as  the  largest  channel  capacity 
which  is  acquired  through  a  data  transmission  rate  which  can 
ensure  reliable  communication.  From  (1)  and  (5),  note  that 
C  oc  ^  and  we  can  derivate  the  specific  data  transmission 
rate  for  Ca-max,  noted  as  Ra-max,  to  achieve  our  goal  - 
trying  to  not  only  enhance  data  transmission  rate  to  shorten  the 
transmission  latency  but  also  ensure  reliable  communication 
and  enhance  network  throughput.  The  value  of  Ra-max  is 
calculated  as  following: 

d  _  /  ip9  m 

-n-a-max  -  W  N  V ' ) 

V  V  +  T}^  Efe=l,fc#i  Pk9k 

In  a  piconet  where  there  are  N  pairs  of  communica¬ 
tion  terminals  making  communication  simultaneously  at  rate 


Ra-max, i[i  =  1,  •  •  •  ,N).  In  this  case,  the  achieved  channel 
capacity  for  each  pair  can  also  be  determined  by  (7).  Thus, 
the  acquired  throughput  ( THput )  of  this  piconet  at  time  t  is 
calculated  as  following: 

rr  tt  \  '  /  £Pi9ii 

i Rput - 2^ \  -  —  2v-jv-i — w 
i=l  V  Vi  +  Pfa  Efc=l,fc^i  pk9ki 

For  this  existed  piconet,  if  we  add  m  more  communication 
pairs.  What  is  the  influence  on  the  throughput?  Through 
analyzing  the  change  of  throughput  when  adding  different 
number  of  communication  pairs  or  picking  up  same  number 
of  communication  pairs  but  located  at  different  positions,  we 
obtain  criteria  for  forming  simultaneous  transmission  subset 
to  improve  throughput. 

Without  making  any  influence  on  analysis,  we  let  Ui  = 
■Pfcfffci  and  the  newly  added  communication 
pairs  are  pair  N  +  1  to  pair  N  +  m.  The  impact  of  adding 
more  simultaneous  transmissions  on  throughput  is  evaluated 
by  the  change  of  it,  noted  as  A THput  and  calculated  using 
(9). 


TH’put 


j  _ tpi9ii 

ni  +  Ui  +  Tfo2  Ylk=N+ 


i=N+ 1  \j  Vi  +  TjC2  pk9ki 

-  |/1%  ® 

We  also  obtain  the  relationship  between  the  new  data  trans¬ 
mission  rate  R'a-maXti  and  the  original  data  transmission  rate 
Ra-max, i  for  existed  communication  pair  *(*  =  !,•••  ,  N)  as 


rp  2  N  +  m 

following:  - h  — — ■■pp*?11'—.  Note 

that,  adding  some  communication  pairs  into  an  existed  network 

will  decrease  Ra-ma x,  since  — - fp^~~ -  1S  positive. 

While,  how  much  Ra-max  will  be  degraded  is  determined  by 
the  sum  interference  coming  from  all  simultaneous  users. 

From  (9),  we  found  that,  when  adding  communication 
pairs,  the  change  of  throughput  of  a  piconet  is  related  with 
the  negative  influence  degrading  Ra-max  for  existed  com¬ 
munication  pairs  and  the  positive  influence  for  permitting 
more  communication  pairs  work  concurrently.  Thus,  from 
network  perspective,  letting  more  communication  pairs  work 
concurrently  does  not  definitely  mean  enhancing/degrading 
throughput.  There  is  a  watershed  for  it.  That  is,  when  the 
negative  influence  is  smaller  than  or  equals  to  the  positive 
influence  of  adding  some  communication  pairs  into  an  existed 
network,  system  performance  in  term  of  throughput  will  be 
improved,  or  at  least  not  be  degraded.  That  is  A THput  >  0. 

First,  we  consider  the  N  =  1  and  m  =  1  scenario  as 
the  our  analysis  basis.  We  assume  that  the  distance  between 
transmitter  and  receiver  is  same  for  each  communication  pair 
(i.e.,  da  =  d ),  and  each  communication  pair  can  use  same 


L.  Note 


power  to  make  transmission.  Moreover,  the  noise  floor  for 
communication  is  fixed.  Thus  (9)  is  simplied  into  (10). 


AT  Hput  = 
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(10) 


In  this  case,  A THput  is  a  function  of  d\2  and  d2\.  Here, 
di2  denotes  the  distance  between  the  original  existed  commu¬ 
nication  pair’s  transmitter  and  the  added  communication  pair’s 
receiver,  and  d2i  stands  of  the  distance  between  the  added 
communication  pair’s  transmitter  and  the  original  existed 
communication  pair’s  receiver. 

Note  that,  to  ensure  the  throughput  to  be  enhanced  when 
adding  one  or  more  communication  pairs,  the  shortest  dis¬ 
tance  for  d\2  and  d2\  should,  at  least,  equal  to  a  threshold 
dmin,  which  is  related  with  the  choice  of  other  parameters, 
such  as  transmission  power  P,  environment  noise  strength  77, 
environment  exponential  /3,  symbol  time  T/,  monocycle  shape 
a2,  power  gain  go  and  communication  range  of  node  drag,  but 
the  distance  between  the  transmitter  and  the  receiver  of  the 
originally  existed  communication  pair.  That  is: 


IV.  Throughput-Maximized  MAC  Protocol 
(TM-MAC)  Description 

We  ever  proposed  an  energy-efficient  MAC  protocol: 
ASCEMAC[8]  for  WSNs.  ASCEMAC  forces  nodes  power  on 
and  off  their  batteries  alternately  to  implement  communication, 
as  well  as  to  reduce  energy  consumption  on  collision  and 
idle  listening  to  extend  network  lifetime.  However,  for  UWB 
communication  systems,  TDMA-based  and  contention-based 
mutual  exclusion  medium  access  control  schemes  are  not  the 
best  choice  any  more.  We  propose  our  algorithm:  throughput- 
maximized  MAC  protocol  specially  for  UWB  communication 
systems. 

Inheriting  the  advantages  of  ASCEMAC,  TM-MAC  also 
divides  the  system  time  into  four  phases:  TRFR-Thase, 
Schedule-flroadcast-Phase,  On-Phase  and  Off-Phase.  Off- 
Phase  is  preserved  for  all  nodes  to  power  off  their  radios  and 
On-Phase  is  preserved  for  to  power  on  their  radios  to  carry 
on  communication.  TRFR  message,  PRFR-Phase  duration  de¬ 
sign,  matching  schedule  establishment,  maintenance,  schedule 
interval  design  and  time-slot  allocation  mechanisms  for  TM- 
MAC  are  similar  to  ASCEMAC’s.  But,  in  TM-MAC,  during 
On-Phase  the  further  divided  super-time-slots  are  individually 
occupied  by  subsets,  a  group  of  communication  pairs.  TM- 
MAC,  based  on  the  network  topology,  is  responsible  for 
further  dividing  a  network  into  a  set  of  subsets  in  which 


communication  pairs  can  make  communication  simultaneously 
to  maximize  throughput,  so  that  to  improve  better  energy 
efficiency  and  spectrum  utilization. 

A.  Modelling  Network 

The  network  is  represented  as  a  directed  graph  G  =  ( V ,  E ). 
V  is  the  set  of  nodes  in  a  piconet.  e  =  (u,v)  is  an  edge 
in  E  iff  nodes  u  and  v  are  transmitter  and  receiver  of  a 
communication  pair.  Fig.  1(a)  shows  an  example,  in  which 
nodes  A ,  C,  E,  G,  I,  K  are  transmitters,  B,  D,  F,  H,  J,  L  are 
receivers.  In  our  algorithm  the  interference  caused  by  newly 
added  communication  pairs  is  untolerable  when  the  created 
throughput  is  smaller  than  the  original  one.  Otherwise,  it 
is  tolerable.  The  untolerable  interference  coming  from  other 
communication  pairs  are  denoted  by  dotted  lines.  According 
to  the  network  topology,  we  utilize  the  conclusion  acquired 
in  (11)  to  determine  the  untolerable  interference  from  other 
communication  pairs. 


Fig.  1.  (a)  Network  graph  G,  (b)  Interference  tolerable  graph  G'  and 

(c)Potential  Group  formation  graph  G" 


B.  Forming  Interference-Tolerated  Relationship 

We  consider  all  communication  pairs  in  a  piconet.  We 
generate  the  interference  tolerable  graph  G'  =  {V' ,  E'). 
V'  C  E,  i.e.,  each  node  in  G'  is  a  communication  pair  in 
G.  e'  =  (u',v')  is  an  edge  in  E'  iff  the  achieved  throughput 
is  higher  than  the  throughput  generated  by  those  two  terminals 
separately.  Fig.  1(b)  presents  the  inference  tolerable  graph  for 
the  graph  in  Fig.  1(a). 

C.  Forming  Potential  Subset 

We  generate  the  potential  subset  forming  graph  G" .  G"  = 
(Vi,  V2,  E")  is  a  bit-partite  graph  such  that  V\  =  V',  and  each 
node  in  V2  presents  all  cliques  and  sub-cliques  in  G' ,  e"  = 
( u ",  v")  is  an  edge  in  E"  iff  u"  e  V) ,  v"  G  V2,  and  u"  belongs 
to  one  of  cliques  in  G'  represented  by  v" .  Fig.  1(c)  represents 
the  potential  group  formation  graph  for  the  inference  tolerable 
graph  shown  in  Fig.  1(b). 

Each  clique  in  G'  represents  a  potential  subset  of  commu¬ 
nication  pairs  which  can  make  simultaneous  communication 
to  enhance  throughput.  We  represent  each  node  in  V\  as  an 
one-off  source  which  is  granted  and  must  to  be  used  once 
in  a  super-timeslot.  Then,  the  subset  formation  in  a  piconet 
represents  the  optimal  classification  for  all  nodes  within  a 
piconet,  and  a  node  in  V2  is  permitted  to  occupy  the  channel 
mutually  excluded  if  and  only  if  it  can  make  contribute  to 
achieve  higher  throughput  and  one-off  source  is  still  available. 


Let  hj  be  an  indicator  function  such  that  Uj  =  1  if  the 
node  j  €  V2  is  allocated  with  channel  by  node  i  €  Vi, 
and  kj  =  0  otherwise.  Let  TH'vut  be  the  sub-throughput 
generated  by  node  j.  Then  the  channel  allocation  problem  can 
be  represented  as  a  set  of  the  following  linear  constraints. 

vz,  £/0-  =  i 

3 

Vi  rj=TH'putJxJ2Ia  (12) 

i 

Where  rj  is  the  sub-throughput  generated  by  node  j  in 
V2,  Note  that  this  set  of  constraints  captures  the  location- 
dependent  interference  on  piconet  throughput  characteristics 
of  UWB  communication  systems. 

D.  Maximizing  Piconet  Throughput 

Consider  a  utility  function  U(r)  for  a  throughput  r  is 
defined  as: 

U(r)  =  (13) 

r 

where  Ltr  stands  of  number  of  information  bits  waiting  for 
transmission.  The  unit  for  r  is  bits /sec. 

Since  our  goal  for  subset  optimization  is  to  improve  pi¬ 
conet  throughput  as  much  as  possible,  maximizing  piconet 
throughput  problem  can  be  modelled  by  the  following  system 
of  equations: 

Minimize  Em*) 

3 

subject  to 

Vi,  £/y  =  l 

3 

Vj.Vi,  rj  =  TH'puUj  x  (14) 

i 

V.  Performance  Evaluation 

We  do  a  set  of  simulations  to  evaluate  the  performance 
of  our  algorithm:  TM-MAC.  A  network  with  amount  of 
communication  pairs  is  set  up.  Those  communication  pairs 
are  deployed  randomly  in  an  area  of  50  x  50m2  and  have 
no  mobility.  This  network  can  be  treated  as  one  piconet  in 
a  large-scale  system.  Parameters  for  simulations  are  given  in 
Table  I. 

We  deploy  5  to  40  communication  pairs  separately  in  the 
same  region.  Then,  we  form  simultaneous  transmission  subsets 
using  our  TM-MAC.  We  observe  the  number  of  generated 
subsets.  We  run  Monte  Carlo  simulations  and  make  average 
operation  on  results  to  remove  the  randomicity  of  simulation 
results.  The  results  are  shown  in  Table  II.  We  also  observe 
the  chance  for  each  communication  pair  being  classified  into 
different  subset  (See  Table  III). 

Note  that,  from  Table  II  and  Table  III,  three  to  four  percent 
of  communication  pairs  being  classified  into  1-pair,  2-pair 
or  3-pair  subset.  Even  with  node  density  increase,  there  is 


TABLE  I 

VALUE  FOR  PARAMETERS  USED  BY  SIMULATIONS. 


Parameter 

Typical  Value 

Parameter 

Typical  Value 

T, 

100  ns 

<7" 

1.9966  x  10-J 

V 

2.568  X  10-'21  V2s 

P 

1  pW 

d 

1  m 

£ 

log? 

So 

7.9433  x  10_ti 

drag 

20  m 

more  chance  to  form  4-pair  subset,  i.e.,  there  is  10%  higher 
probability  for  40  communication  pairs  scenario  than  for  5 
communication  pairs  scenario. 

Within  a  piconet,  we  make  simulation  to  check  the  actual 
throughput  achieved  by  different  subsets.  Under  various  node 
density,  we  plot  throughput  versus  various  types  of  subset 
(See  Fig.  2(a)).  Note  that,  subset  within  which  there  are 
more  communication  pairs  making  communication  simulta¬ 
neously  acquires  higher  throughput.  In  40  communication 
pairs  scenario,  the  throughput  achieved  by  4-pair  subset  is 
1 19309kbps,  which  is  around  two  times  of  the  throughput 
achieved  by  1-pair  subset.  Moreover,  with  the  node  density 
increase,  the  largest  throughput  which  can  be  achieved  is 
decreased  since  the  interference  coming  from  other  users  is 
increased. 


Fig.  2.  (a)  Throughput  Achieved  per  Subset  and  (b)  Average  Throughput 

Achieved 

We  compare  our  TM-MAC  against  802.15.3a,  which  uses  a 
mutual  exclusion  scheme  to  implement  medium  access  control 
-  TDM  A  scheme.  We  check  the  throughput  achieved,  trans¬ 
mission  time  needed  for  certain  traffic  load  and  longest  latency 
for  data  packet  for  our  TM-MAC  and  802.15.3a  (See  Fig.  2(b), 
Fig.  3(a)  and  Fig.  3(b)).  Note  that,  our  TM-MAC  can  achieve 
higher  throughput  than  802.15.3a  around  5.97%  to  25.358%. 
Given  same  amount  of  traffic  to  networks  which  run  TM-MAC 
and  802.15.3a  separately,  the  transmission  time  needed  for 
TM-MAC  is  shorter  than  the  one  for  80.215.3a.  The  reduced 
ratio  for  various  nodes  density  from  40  communication  pairs  to 
5  communication  pairs  locates  within  the  range  from  10.358% 
to  32.18%.  Since  802.15.3a  uses  mutual  excluded  scheme 
for  medium  access  control,  the  communication  for  various 
communication  pairs  is  carried  out  serially.  While,  for  TM- 
MAC,  some  communication  pairs  can  make  communication 
simultaneously.  The  longest  latency  for  TM-MAC  is  shorter 
than  802.15.3a.  The  decreased  ratio  is  from  18.554%  to 
65.869%. 


Fig.  3.  (a)  Transmission  Duration  Needed  and  (b)  Longest  Latency  for  Data 
Packet 


VI.  Conclusion 

In  this  paper,  we  propose  a  MAC  protocol:  throughput- 
maximized  MAC  protocol  (TM-MAC),  based  on  the  char¬ 
acteristics  of  ultra  wideband  (UWB)  technology.  In  UWB 
communication  systems,  the  transmission  parameters  are  tun¬ 
able  to  match  the  requirements  of  data  flow.  In  TM-MAC, 
we  implement  concurrent  multiuser  access  instead  of  mutual 
exclusion  method,  such  as  TDMA  and  random  access.  For 
multiuser  interference,  we  establish  a  model  to  adaptively 
adjust  the  data  transmission  rate  to  ensure  satisfied  signal 
to  noise  ratio  (SNR)  at  receiver  side.  We  also  analyze  the 
relationship  among  the  theoretical  maximum  channel  capacity, 
the  achievable  maximum  channel  capacity  and  the  maximum 
data  transmission  rate.  According  to  the  network  topology, 
TM-MAC  re-divides  network  into  subsets,  in  which  com¬ 
munication  pairs  can  make  communication  simultaneously 
to  enhance  throughput  and  to  exploit  as  fast  as  possible 
data  transmission  rate  for  reliable  communication.  For  subset 
formation,  we  propose  a  general  analytical  framework,  which 
captures  the  unique  characteristics  of  shared  wireless  channel 
and  allows  to  model  a  large  class  of  systemwide  throughput 
maximization  issue  via  the  specification  of  per-link  utilization 
functions. 

Simulation  results  demonstrate  that  TM-MAC  can  imple¬ 
ment  throughput  maximization  to  shorten  latency  and  to 
enhance  network  processing  capability. 
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TABLE  II 

NUMBER  OF  DIFFERENT  KIND  OF  SUBSET  GENERATED  WITH  VARIOUS 
NODE  DENSITY.  1-PAIR  SUBSET,  2-PAIR  SUBSET,  3-PAIR  SUBSET  AND 
4-PAIR  SUBSET  SEPARATELY  DENOTE  THE  SUBSET  IN  WHICH  THERE  IS 
ONE  COMMUNICATION  PAIR,  TWO  COMMUNICATION  PAIRS,  THREE 
COMMUNICATION  PAIRS  OR  FOUR  COMMUNICATION  PAIRS. 


5  pairs 

10  pairs 

15  pairs 

20  pairs 

30  pairs 

40  pairs 

1  -pair  subset 

1.648 

2.964 

4.198 

5.782 

8.398 

10.635 

2-pairs  subset 

0.824 

1.368 

2.175 

3.058 

5.081 

7.293 

3-pairs  subset 

0.464 

0.62 

0.856 

1.134 

1.908 

2.677 

4-pairs  subset 

0.078 

0.61 

0.771 

1.105 

1.429 

1.687 

TABLE  III 

PERCENTAGE  OF  COMMUNICATION  PAIR  BEING  CLASSIFIED  INTO  WITHIN 
DIFFERENT  KIND  OF  SUBSET  FOR  VARIOUS  NODE  DENSITY 


Vo 

5  pairs 

10  pairs 

15  pairs 

20  pairs 

30  pairs 

40  pairs 

1  -pair  subset 

32.96 

29.64 

27.987 

28.91 

27.993 

26.587 

2-pair  subset 

32.96 

27.36 

29 

30.58 

33.873 

36.465 

3-pair  subset 
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Abstract — Query  processing  has  been  studied  extensively  in 
traditional  database  systems.  But  few  of  existed  methods  can  be 
directly  applied  to  wireless  sensor  database  systems  due  to  their 
characteristics,  such  as  decentralized  nature,  limited  computa¬ 
tional  power,  imperfect  information  recorded,  and  energy  scarcity 
of  individual  sensor  nodes.  In  this  paper,  we  extend  our  previous 
work:  quality-guaranteed  and  energy-efficient  algorithm  (QGEE) 
for  wireless  sensor  database  systems.  We  introduce  radius  of 
covering  disk  from  point  spread  function  (PSF)  aspect  and  sample 
size  for  query  quality  and  energy  consumption  control.  PSF 
introduces  ambiguity  into  query  answers,  since  the  sensitivity 
of  nodes  is  nonuniform  within  monitoring  region.  Sample  size 
determination  refers  to  the  process  of  determining  exactly  how 
many  samples  should  be  measured  in  order  that  the  sampling 
distribution  of  estimators  meets  users’  pre-specified  target  pre¬ 
cision.  In  this  paper,  we  formulate  the  criteria  to  determine  the 
optimum  radius  and  sample  size  according  to  users’  requirements 
on  query  answers.  Simulation  results  demonstrate  that  the  impact 
of  sample  size  and  monitoring  coverage  on  query  answers  in 
terms  of  root  mean  square  error  (RMSE). 

I.  Introduction 

Recent  developments  in  integrated  circuit  technology  have 
allowed  the  construction  of  low-cost  sensor  nodes,  which 
are  generally  equipped  with  sensing  capabilities,  wireless 
communication  and  limited  power  supply,  CPU  and  memory. 
These  devices  are  expected  to  be  embedded  into  environment 
to  create  very  dense  networks[l].  High  level  tasks,  such  as 
monitoring  specific  events,  are  accomplished  by  cooperation 
of  multiple  nodes  to  collect  and  process  information.  Wireless 
sensor  networks  (WSNs),  which  operate  in  unattended  mode, 
represent  an  emerging  type  of  network.  WSNs  are  intended 
for  a  broad  range  of  environmental  sensing  applications 
from  weather  data  collection  to  vehicle  tracking  and  habitat 
monitoring[2][3]. 

The  goal  of  monitoring  through  sensor  nodes  is  to  infer 
information  about  objects  from  measurements  made  from 
remote  locations.  Since  inference  processes  are  always  less 
than  perfect,  there  is  an  element  of  uncertainty  regarding 
answers.  When  viewed  from  this  perspective,  the  problem 
of  uncertainty,  which  stands  for  the  quality  of  query  answer, 
is  central  to  monitoring  applications.  Thus,  to  build  useful 
information  systems,  it  is  necessary  to  learn  how  to  represent 
and  reason  with  imperfect  information.  As  a  result,  Motro[4] 


is  interested  in  how  imperfect  information  may  be  represented 
in  a  database  system.  Turtle  and  Corft[5]  in  their  discussion 
of  uncertainty  in  information  retrieval  systems  also  argue  that 
the  issue  of  imperfect  information  cannot  be  ignored. 

Uncertain  information  is  typically  handled  by  attaching  a 
number,  which  represents  a  subjective  measure  of  an  uncertain 
element  according  to  some  observer,  to  that  element.  The  way 
in  which  the  number  is  manipulated  depends  upon  the  theory 
that  underlies  the  number.  There  are  possibilistic  databases[6] 
and  probabilistic  databases[7][8][9].  Moreover,  probabilistic 
approach  has  started  to  be  used  by  WSNs  to  process  query 
with  limited  information[10]. 

Most  of  algorithms  for  determining  query  processing  strate¬ 
gies  in  WSNs  are  static  in  nature.  In  [11],  Bodorik  proposed 
aborted  join  last  (AJL)  method  to  substitute  static  mechanism 
with  adaptive  one,  which  owns  low  overhead  delay  to  decide 
when  to  correct  a  strategy.  In  AJL,  the  decision  for  correction 
is  computationally  simple  and,  moreover,  a  corrective  strategy 
is  already  exist  when  it  is  decided  to  correct.  Acquisitional 
query  processing  (ACQP)[12],  compared  with  typical  meth¬ 
ods,  focuses  on  betaking  the  significant  new  query  processing 
opportunity  that  arises  in  WSNs:  smart  sensor  nodes  have  the 
capability  to  control  over  where,  when,  and  how  often  data  is 
physically  acquired  (i.e.,  sampled)  and  is  delivered  to  query 
processing  operators. 

Considering  energy  constraint  issues,  some  energy  efficient 
solutions  are  proposed.  Query  processing  based  on  random 
walk  technique[13]  is  an  alternative  scheme  to  implement 
failure  recovery  in  dynamic  environment.  The  robustness  of 
this  approach  under  dynamic  situation  follows  the  simplicity 
of  processing,  which  only  requires  the  connectivity  of  moving 
neighbors.  In-network  query  processing  is  critical  for  reducing 
network  traffic  when  accessing  and  manipulating  data.  It 
requires  to  place  not  only  a  tree  of  query  processing  operators 
such  as  filters  and  aggregations  but  also  correlations  onto 
nodes  in  order  to  minimize  the  amount  of  data  transmitted 
over  the  network.  In  [14],  an  adaptive  and  decentralized 
algorithm  is  proposed.  This  algorithm  progressively  refines  the 
placement  of  query  processing  operators  by  walking  through 
neighbor  nodes.  Thus,  an  initial  arbitrary  placement  of  query 
processing  operators  can  be  progressively  refined  toward  an 


optimal  placement. 

Existing  query  processing  systems  for  WSNs,  including 
Directed  Diffusion[15],  TinyDB[12]  and  Cougar[16],  provide 
high-level  interface  that  allows  users  to  collect  and  process 
such  continuous  streams.  Note  that  they  are  especially  at¬ 
tractive  as  ways  to  efficiently  implement  monitoring  applica¬ 
tions  without  forcing  users  to  write  complex,  low-level  code 
for  managing  multihop  network  topologies  or  for  acquiring 
samples  from  sensor  nodes.  TinyDB,  Directed  Diffusion  and 
Cougar  are  relatively  mature  research  prototypes  that  give 
some  ideas  on  how  future  query  processing  system  will 
function  for  WSNs. 

Our  previous  work:  quality-guaranteed  and  energy-efficient 
algorithm  (QGEE)  for  wireless  sensor  database  systems[17], 
considers  the  energy  constraint  problem  and  quality  require¬ 
ment  from  active  nodes  election,  information  collection  and 
query  answer  expression.  QGEE  utilizes  in-network  query 
processing  method  to  task  wireless  sensor  database  systems 
through  declarative  queries  and  uses  confidence  interval  strat¬ 
egy  to  determine  the  closeness  of  a  query  answer  to  the 
true  value.  Through  further  study,  we  extend  QGEE  in  this 
paper.  We  introduce  radius  of  covering  disk  from  point  spread 
function  (PSF)  aspect  and  sample  size  for  query  quality  and 
energy  consumption  control.  PSF  introduces  ambiguity  into 
query  answers,  since  the  sensitivity  of  nodes  is  nonuniform 
within  monitoring  region.  Sample  size  determination  refers 
to  the  process  of  determining  exactly  how  many  samples 
should  be  measured  in  order  that  the  sampling  distribution 
of  estimators  meets  users’  prc-specified  target  precision.  In 
this  paper,  we  formulate  the  criteria  to  determine  the  optimum 
radius  and  sample  size  according  to  users’  requirements  on 
query  answers. 

The  remainder  of  this  paper  is  organized  as  follows.  Sec¬ 
tion  II  summarizes  our  previous  work  on  QGEE  algorithm. 
Our  extending  work  is  discussed  in  Section  III.  Simulation 
results  are  given  in  Section  IV.  Section  V  concludes  this  paper. 

II.  Quality-Guaranteed  and  Energy-Efficient 
(QGEE)  Query  Procession  Protocol 

QGEE  employs  an  in-network  query  processing  method  to 
task  networks  through  declarative  queries,  which  is  critical 
for  reducing  network  traffic  when  accessing  and  manipulating 
sensor  data.  In  QGEE  algorithm,  only  a  subset  of  nodes  within 
a  network  will  be  chosen  to  acquire  readings  or  samples 
corresponding  to  the  fields  or  attributes  referenced  in  queries. 
The  goal  of  our  approach  is  to  reduce  interference  coming 
from  measurements  with  extreme  errors  and  to  minimize 
energy  consumption  by  providing  service  that  is  considerably 
necessary  and  sufficient  for  the  needs  of  application.  Moreover, 
according  to  the  analysis  and  classification  on  sources  of 
imperfect  information,  we  employ  probabilistic  method  to 
formulate  the  distribution  of  them  in  terms  of  probability  dis¬ 
tribution  function  (PDF).  Finally  probabilistic  query  answers 
are  acquired  on  uncertain  data.  The  probability  in  a  query 
answer  allows  users  to  place  appropriate  confidence  in  it  as 
opposed  to  having  an  incorrect  answer  or  no  answer  at  all. 


From  four  aspects,  QGEE  implement  quality  control: 

•  Query  Vector  Space  Model  (VSM)  Design  and  Active 
Nodes  Selection:  VSM  is  employed  by  us  to  combine  all 
considering  factors,  such  as  node  location,  measurement 
quality  and  remaining  battery  capacity,  to  select  the  most 
related  nodes  to  participate  query  processing.  When  a 
query  submitted,  the  related  top-end  node  fixes  on  optimal 
locations  for  this  query  and  translates  the  query  into  a 
query  VSM  vector.  Query  VSM  vector  and  information 
on  optimal  locations  will  be  flooded  over  the  whole 
network.  A  query  correlation  is  designed  to  express 
the  correlation  between  each  node  and  a  query.  Query 
correlation  is  a  function  of  query  quality  requirement, 
nodes’  energy,  measurement  quantify  and  location.  The 
criterion  for  active  nodes  choosing  is  the  decision-which 
nodes  are  active  to  respond  queries-is  based  on  their 
query  correlations.  That  is,  nodes  with  highest  query 
correlation  among  their  one-hop  neighbors  are  chosen  to 
participate  in  related  query  processing.  In  QGEE,  active 
nodes  are  chosen  locally  leveraging  cooperations  among 
nodes. 

•  Optimal  Location  Determination:  We  model  the  problem- 
determining  optimal  locations  for  a  query,  as  a  k-partial 
set  cover  problem.  It  is  a  NP  problems  which  requires 
time  that  is  exponential  to  the  problem  size.  We  exploit 
an  approximation  algorithm:  SETCOVER[18],  which  can 
acquire  the  solution  during  polynomial  time,  to  determine 
the  value  of  k  and  the  locations  of  these  k  disks  on  a 
plane.  In  QGEE,  we  choose  centers  of  those  k  disks  as 
our  optimal  locations. 

•  Query  Answer  Expression:  Since  a  statistic  measure¬ 
ment  on  samples  can  rarely,  if  ever,  be  expected  to  be 
exactly  equal  to  a  parameter,  it  is  important  that  an 
estimation  is  accompanied  by  a  statement  which  describes 
the  precision  of  this  estimation.  We  utilize  confidence 
intervals[19]  to  state  both  how  close  the  value  of  a 
statistic  being  likely  to  be  value  of  a  parameter  and  the 
chance  of  being  close. 

•  Information  Collection:  After  active  nodes  are  chosen, 
a  data  centric  routing  algorithm,  EM-GMR[20]  is  em¬ 
ployed,  which  is  a  multipath,  power-aware  and  mobility- 
aware  routing  scheme.  It  is  used  to  establish  route-tree 
from  active  nodes  to  front-end  nodes  for  query  answer 
return.  EM-GMR  uses  reactive  networking  approach,  in 
which  it  finds  a  route  only  when  a  message  is  to  be 
delivered  from  source  to  destination.  EM-GMR  considers 
distance,  remaining  battery  capacity,  and  mobility  of  each 
sensor  node  during  route  path  setting  up.  This  scheme 
could  tremendously  reduce  frame  loss  rate  and  link  failure 
rate  since  mobility  is  considered,  so  that  incompleteness 
information  caused  by  poor  link  quality  can  be  reduced 
at  certain  degree. 

In  energy  consumption  control,  first,  in  the  query  SVM 
design,  node  location  is  included  besides  measurement  quality 
and  remaining  battery  capacity,  since  it  is  directly  related  to 


the  necessary  number  of  active  nodes  to  cover  the  whole 
monitoring  region.  Through  solving  optimal  location  problem, 
we  can  employ  as  few  as  possible  nodes  to  cover  as  large 
as  possible  monitoring  region  in  order  to  carry  out  energy 
reservation  task.  Second,  we  tremendously  reduce  the  frame 
loss  rate  and  link  failure  rate  through  choosing  more  suitable 
nodes  to  set  up  route-tree  for  queries.  With  this  improvement, 
we  can  reduce  energy  consumption  for  route-tree  maintenance 
and  information  retransmission. 

III.  Extending  Work  for  Energy  and  Quality 
Aware  Query  Processing 

As  a  motivation  for  our  work,  we  describe  a  scenario: 

•  A  great  multitude  of  temperature  sensor  nodes  are  ran¬ 
domly  deployed  in  a  region  we  are  interested.  Individual 
sensor  nodes  (or  in  short,  nodes)  is  connected  to  other 
nodes  in  its  vicinity  through  wireless  communication 
interface,  and  it  uses  a  multihop  routing  protocol  to 
communicate  with  nodes  that  are  spatially  distant.  All 
nodes  are  interconnected  to  at  least  one  powered  PC 
(front-end  node)  directly  or  through  intermedial  nodes. 
Front-end  nodes  are  in  charge  of  processing  data,  on  the 
opposite  direction,  disseminating  queries  to  related  nodes. 
Within  this  WSN,  each  node  owns  equal  computing  and 
sensing  capabilities,  but  measurement  quality  for  sensor 
parts  may  be  not  identical. 

A.  Radius  Control  for  k-Partial  Set  Cover  Problem 

Point  spread  function  (PSF)  of  nodes  introduces  ambiguity 
into  query  answers.  Our  temperature  monitoring  application  is 
interested  in  the  temperature  over  a  region  instead  of  one  point 
in  space.  But  considering  operation  feasibility,  cost  and  speed, 
sampling  method  is  widely  used  instead  of  completely  mea¬ 
suring.  In  this  aspect,  another  imperfect  information  source 
is  raised  -  PSF.  PSF  is  caused  by  nonuniform  sensitivity 
within  nodes’  local  space.  In  general,  nodes  exhibit  sensitivity 
variation  similar  to  what  is  shown  in  Fig.  1.  Note  that,  nodes 
are  more  sensitive  to  the  center  of  their  regions  than  toward 
the  edge. 


Fig.  1.  (a)  1 -Dimension  Gaussian  model  of  a  PSF  and  (b)  2-Dimension 

Gaussian  model  of  a  PSF 

Among  various  locations  within  a  disk,  measurements  of 
active  nodes  own  the  lowest  sensitivity/confidence  when  they 
stand  for  the  situation  at  this  disk’s  edge.  This  nature  inspires 
us  to  acquire  the  criterion  to  calculate  suitable  value  for  r. 
That  is,  if  the  sensitivity/confidence  equals  to  or  higher  than  p 


at  the  edge  of  disk,  then  we  can  ensure  that  the  measurements 
of  active  nodes  can  represent  the  situation  within  this  disk  at 
least  with  p  confidence. 

Considering  the  impact  of  PSF  on  the  uncertainty  of  query 
answer,  we  adaptively  adjust  the  radius  r  of  disks  according 
to  users’  quality  requirements  instead  of  fixing  it.  We  assume 
that  PSF  ( g(d ))  of  nodes  in  a  WSN  is  defined  by  (1)  and 
confidence  of  query  answer  is  required  to  be  at  least  p. 

9(d)  =  0) 

(TV  Z7T 

where  d  is  the  distance  between  a  point  and  the  center  on  a 
disk,  o 2  is  the  variance  of  d.  g(d)  has  the  similar  form  as 
shown  in  Fig.  1(a). 

We  derive  (2)  from  (1)  to  determine  r. 

r  =  o\/ln(2Tro2(l  —  p)2)  (2) 

Note  that  r  is  a  function  of  standard  deviation  of  PSF  o  and 
query  quality  requirement  p.  If  we  fixed  cr,  r  will  decrease 
with  increasing  of  p.  That  means,  with  higher  query  quality, 
smaller  disks  are  used  to  search  the  optimum  locations  and 
more  active  nodes  are  needed  for  a  query  processing. 

B.  Sample  Size  Control  for  Data  Sensing 

We  have  chosen  a  set  of  nodes  to  respond  a  query.  However, 
“How  many  measurements  should  be  included  in  one  sam¬ 
ple?”  is  the  question  we  will  answer  in  this  Section.  Sample 
(any  subset  of  a  population)  size  determination  refers  to  the 
process  of  determining  exactly  how  many  samples  should  be 
measured/observed  in  order  that  the  sampling  distribution  of 
estimators  meets  users’  pre-specified  target  precision[21]. 

Since  nodes’  readings  are  subject  to  many  small  and  random 
errors  which  are  caused  by  limitations  of  device’s  hardware 
and  environmental  noise,  uncertainty  is  inherent  regarding  to 
true  values.  Hence  nodes  reading  (a:)  can  be  expressed  as: 

x  =  v  +  em  +  V  (3) 


where  v  is  the  true  value,  em  is  the  measurement  error 
introduced  by  limitations  of  device’s  hardware,  and  rj  is 
the  environmental  noise  which  is  considered  as  added  white 
Gaussian  noise  in  this  paper  and  rj  N{ 0,fQ.  Based 
on  central  limit  theorem[22],  the  probability  distribution  of 
measurement  errors  complies  with  a  normal  distribution.  That 
is,  em  ~  N( 0,  cr2).  Generally,  in  product’s  technical  datasheet, 
manufactories  supply  the  information  on  measurement  error. 
For  example,  the  bias  for  CXM539  is  ilGauss  with  0.95 
confidence[23].  That  means  for  sensor  nodes  CXM593,  o2  = 
0.1302.  For  general  cases,  if  we  know  the  maximum  bias  5 
and  its  confidence  p,  we  can  obtain  the  general  expression  of 
o\.  That  is 
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where  Q(x)  stands  for  Q-Function,  defined  as  Q( x)  = 

C^dy 


Moreover,  em  and  i]  are  independent.  Therefore,  node 
reading  also  complies  with  a  normal  distribution  with  px- mean 
and  ax -standard  deviation  given  in  (5). 


Hx  =  v  and  ax  =  \j  ^ 
Therefore  the  PDF  fx  (x)  of  node  reading  x  is 
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fx(x)  = 


y/M 
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(6) 


Using  sample  mean  to  estimate  the  mean  of  a  random 
variable  is  an  unbiased  estimation.  That  means  the  estimator 
aims  at  the  true  value  or  is  correct  average[24].  In  (5),  the 
mean  px  of  samples  x  equals  to  the  true  value  v.  Moreover, 
we  choose  the  sample  mean  as  our  estimator  for  true  value  v, 
i.e.  vn  =  i  YTj= i  xi-  Thus,  we  do  an  unbiased  estimation  on 
true  value  v.  Here  n  is  the  sample  size.  In  this  case,  the  PDF 
of  vn  is  /p  ( v„ )  with  =  nx  and  a?  =  ±cr%,  as  shown 
in  (7). 
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We  let  Ax  as  the  margin  between  the  estimator  ( vn )  and 
the  true  value  ( v )  to  reflect  the  target  precision  of  queries,  and 
we  specify  our  capability,  ensuring  the  estimation  error  to  be 
within  this  margin,  is  not  to  smaller  than  p.  The  criterion  for 
sample  size  determination  is  simply  stated  as: 


Pr{\vn  -  v\  <  Ax}  >  p  (8) 


Since  we  have  known  the  PDF  of  vn,  the  probability  that 
the  estimation  error  is  not  larger  than  Ax  is 

/\  T 

Pr{\K  ~  »|  <  AX}  -  1  -  2Q(  )  (9) 

slW 

Solving  (8)  and  (9)  for  sample  size  n,  we  obtain 


n  > 


(gj I  +  W-1^)]2 

Ax 2 


(10) 


Since  too  large  sample  size  implies  a  waste  of  resources,  and 
too  small  sample  size  will  diminish  the  utilization  of  results. 
In  this  paper,  we  exploit  the  smallest  value  that  satisfies  (10)  to 
specify  the  value  of  sample  size  during  information  sensing, 
so  that  we  can  acquire  enough  samples  to  meet  users’  pre¬ 
specified  target  precision,  and  reduce  energy  consumption  for 
data  sensing. 


IV.  Simulations  and  Performance  Evaluation 

In  our  simulations,  100  nodes  are  randomly  deployed  in  an 
area  of  10  x  10m2,  and  sensing  range  for  individual  nodes 
is  lm.  We  run  Monte  Carlo  simulations  to  remove  the  ran- 
domicity  of  simulation  results.  We  evaluate  the  performance 
of  our  extending  work.  In  all  experiments,  we  assume  that 
the  true  value  is  known  to  us.  We  define  the  root  mean 
square  error  (RMSE)  to  express  the  error  of  query  results, 


Fig.  2.  Error  Corresponding  to  Query  Result  Caused  by  Radius  of  Covering 
Disk 


i.e.,  RMSE  =  ^/E£_j  (fij  -  v)2.  Here,  k  is  the  number  of 
Monte  Carlo  simulation. 

For  MAXIMUM,  MINIMUM  and  AVERAGE  data  aggre¬ 
gation  operations,  we  check  the  error  caused  by  the  size  of  k- 
partial  set  cover  disk  and  the  sample  size.  In  Fig.  2  and  Fig.  3, 
we  plot  radius  and  sample  size  versus  error  corresponding  to 
query  results  separately.  We  can  see  that  with  radius  of  k- 
partial  set  cover  disk  increasing,  the  error  of  query  results, 
expressed  by  root  mean  square  error  (RMSE),  is  enlarged 
around  tens  times.  Large  sample  size  can  make  the  acquired 
query  results  be  more  close  to  true  values,  i.e.,  the  RMSE  for 
30  sample  size  scenario  is  only  30%  to  the  one  for  2  sample 
size  scenario.  Moreover,  the  impact  of  radius  of  disk  and 
sample  size  for  AVERAGE  data  aggregation  is  much  smaller 
than  for  MAXIMUM  and  MINIMUM  data  aggregations.  The 
reason  is,  for  normal  random  variable,  the  mean  operation 
for  all  samples  can  counteract  some  errors  during  estimation. 
Furthermore,  we  note  that  the  impact  from  PSF  on  query 
answers  is  much  bigger  than  the  impact  from  sample  size. 

Based  on  (2)  and  (10),  we  acquire  the  optimum  values  for 
radius  of  k-partial  set  cover  disk  and  sample  size,  as  shown 
in  Table  I.  Observe  that  in  order  to  monitoring  same  region 
interested,  the  number  of  active  node  needed  when  p  equals 
to  0.95  is  2  times  when  p  equals  to  0.5.  In  addition,  more 
samples  are  needed  to  satisfy  the  predefined  precision  on 
query  answers,  such  as  the  sample  size  for  p=0.95  is  10  times 
to  p=0.5.  Note  that,  there  is  a  tradeoff  between  the  energy 
consumption  and  the  quality  of  answer. 

V.  Conclusions 

Query  processing  has  been  studied  extensively  in  traditional 
database  systems.  But  few  of  existed  methods  can  be  directly 
applied  to  wireless  sensor  database  systems  due  to  their 
characteristics,  such  as  decentralized  nature,  limited  compu¬ 
tational  power,  imperfect  information  recorded,  and  energy 
scarcity  of  individual  sensor  nodes.  In  this  paper,  we  extend 


Fig.  3.  Error  Corresponding  to  Query  Results  Caused  by  Sample  Size 
TABLE  I 

OPTIMUM  VALUES  FOR  RADIUS  AND  SAMPLE  SIZE  TO  GUARANTEE  QUERY 
QUALITY. 


p=0.95 

p=0.85 

p=0.75 

p=0.5 

Radius  (meter) 

1.0079 

1.0732 

1.217 

2.1933 

Sample  Size  (readings/sample) 

10.564 

5.6987 

3.6391 

1.2511 

our  previous  work:  quality-guaranteed  and  energy-efficient 
algorithm  (QGEE)  for  wireless  sensor  database  systems.  We 
introduce  radius  of  covering  disk  from  point  spread  function 
(PSF)  aspect  and  sample  size  for  query  quality  and  energy 
consumption  control.  PSF  introduces  ambiguity  into  query 
answers,  since  the  sensitivity  of  nodes  is  nonuniform  within 
monitoring  region.  Sample  size  determination  refers  to  the 
process  of  determining  exactly  how  many  samples  should  be 
measured  in  order  that  the  sampling  distribution  of  estimators 
meets  users’  pre-specified  target  precision.  In  this  paper, 
we  formulate  the  criteria  to  determine  the  optimum  radius 
and  sample  size  according  to  users’  requirements  on  query 
answers. 
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Abstract —  Starting  from  the  characteristics  of  newly  appeared 
wireless  sensor  networks,  such  as  decentralized  nature,  limited 
computational  power,  imperfect  information  recorded,  and  en¬ 
ergy  scarcity  of  individual  sensor  nodes,  we  have  done  some 
works  to  solve  energy  constraint  and  quality  required  problems 
from  active  nodes  election,  information  collection  to  query 
answer  expression  perspectives.  In  this  paper,  extending  our 
previous  works,  we  propose  two  methods  to  substitute  cosine 
measure  for  vector  similarity:  Cosine-Length  Measure  (CLM) 
and  Joint-Deference  Measure  (JDM).  Through  considering  the 
impact  of  vector  length  on  vector  similarity,  CLM  alleviates  the 
disadvantage  of  traditional  VSM,  in  which  the  confidence  of 
query  answer  may  be  degraded  since  truly  similar  nodes  cannot 
be  elected  according  to  users’  requirement.  JDM  upgrades  the 
accuracy  and  degrades  the  complexity  for  the  computation  on 
similarity  coefficient  through  simplifying  the  measure  from  vector 
domain  to  scalar  domain.  In  addition,  with  the  distributions 
of  measurement  error  and  environment  noise  known  and/or 
unknown  respectively,  we  formulate  the  criteria  to  determine  the 
optimum  sample  size  to  meet  users’  prc-specified  target  precision. 
Through  simulation,  we  check  the  validities  and  sensitivities  of 
cosine  measure,  CLM  and  JMD  methods  on  answer  quality  and 
network  lifetime.  Furthermore,  our  simulation  results,  in  this 
paper,  form  a  set  of  criteria  for  method  selection  based  on  specific 
applications. 

I.  Introduction 

Recent  developments  in  integrated  circuit  technology  have 
allowed  the  construction  of  low-cost  sensor  nodes,  which 
are  generally  equipped  with  sensing  capabilities,  wireless 
communication  and  limited  power  supply,  CPU  and  memory. 
These  devices  are  expected  to  be  embedded  into  environment 
to  create  very  dense  networks[l].  High  level  tasks,  such  as 
monitoring  specific  events,  are  accomplished  by  cooperation 
of  multiple  nodes  to  collect  and  process  information.  Wireless 
sensor  networks  (WSNs),  which  operate  in  unattended  mode, 
represent  an  emerging  new  type  of  network.  WSNs  are  in¬ 
tended  for  a  broad  range  of  environmental  sensing  applications 
from  weather  data  collection  to  vehicle  tracking  and  habitat 
monitoring[2][3]. 

The  goal  of  monitoring  through  sensor  nodes  is  to  infer 
information  about  objects  from  measurements  made  from 
remote  locations.  Since  inference  processes  are  always  less 
than  perfect,  there  is  an  element  of  uncertainty  regarding 
answers.  When  viewed  from  this  perspective,  the  problem  of 
uncertainty,  which  stands  for  the  quality  of  query  answers,  is 
central  to  monitoring  applications.  Thus,  to  build  useful  infor¬ 
mation  systems,  it  is  necessary  to  learn  how  to  represent  and 


reason  with  imperfect  information  effectively  and  efficiently. 

Uncertain  information  is  typically  handled  by  attaching  a 
number,  which  represents  a  subjective  measure  of  an  uncertain 
element  according  to  some  observers,  to  the  answer.  The  way 
in  which  the  number  is  manipulated  depends  upon  the  theory 
that  underlies  the  number.  There  are  possibilistic  databases[4] 
and  probabilistic  databases[5]  [6]  [7] .  Moreover,  probabilistic 
approach  has  started  to  be  used  by  WSNs  to  process  query 
with  limited  information[8]. 

Most  of  algorithms  for  determining  query  processing  strate¬ 
gies  in  WSNs  are  static  in  nature.  In  [9],  Bodorik  proposed 
aborted  join  last  (AJL)  method  to  substitude  static  mechanism 
with  adaptive  one  which  owns  low  overhead  delay  to  decide 
when  to  correct  a  strategy.  Acquisitional  query  processing 
(ACQP)[10],  Compared  with  typical  methods,  focuses  on 
betaking  the  significant  new  query  processing  opportunity  that 
arises  in  WSNs:  smart  sensor  nodes  have  the  capability  to 
control  over  where,  when,  and  how  often  data  is  physically 
acquired  (i.e.,  sampled)  and  is  delivered  to  query  processing 
operators. 

In  energy  constraint  and  quality  required  problems  for  query 
processing,  we  have  done  some  works  from  active  nodes 
election,  information  collection  to  query  answer  expression 
perspectives.  QGEE[12]  utilizes  in-network  query  processing 
method  to  task  wireless  sensor  database  systems  through 
declarative  queries  and  uses  confidence  interval  strategy  to 
determine  the  closeness  of  a  query  answer  to  the  true  value. 
However,  vector  space  model  (VSM)  is  one  of  very  efficient 
methods  to  quantify  the  correlation  between  a  query  and 
all  candidate  documents,  but  traditional  cosine  measure  for 
similarity  calculation  is  not  a  method  that  reflects  the  similarity 
among  vectors  accurately  or  completely  (i.e.,  only  from  the 
angle  aspect  to  express  similarities).  Moreover,  sample  size 
determination  is  another  important  issue  for  energy  reservation 
and  quality  control.  “How  many  measurements  should  be 
included  in  one  sample  (any  subset  of  a  population)?”  is  an 
essential  question  to  be  answered. 

In  this  paper,  we  propose  two  methods  to  substitute  cosine 
measure:  Cosine-Length  Measure  (CLM)  and  Joint-Deference 
Measure  (JDM).  Through  considering  the  impact  of  vector 
length  on  vector  similarity,  CLM  alleviates  the  disadvantage 
of  traditional  VSM,  in  which  the  confidence  of  query  answer 
may  be  degraded  since  truly  similar  nodes  cannot  be  elected 
according  to  users’  requirement.  JDM  upgrades  the  accuracy 
and  degrades  the  complexity  for  the  computation  on  similarity 


2 


coefficient  through  simplifying  the  measure  from  vector  do¬ 
main  to  scalar  domain.  In  addition,  knowing  and/or  without 
knowing  the  distributions  of  measurement  error  and  environ¬ 
ment  noise  respectively,  we  formulate  the  criteria  to  determine 
the  optimum  sample  size  to  meet  users’  pre-specified  target 
precision.  Moreover,  we  check  the  validities  and  sensitivities 
of  cosine  measure,  CLM  and  JMD  methods  for  quality  and 
network  lifetime  through  simulation.  The  most  important  thing 
is  that,  the  availability  for  those  simulation  results  is  not 
limited  to  the  scenario  for  our  specifical  simulations.  Since 
during  the  design  of  query  vector,  we  remove  the  dependence 
on  the  absolute  values  related  with  network  scenario  and  query, 
our  results,  in  this  paper,  form  a  set  of  criteria  for  method 
selection  based  on  specifical  applications. 

The  remainder  of  this  paper  is  organized  as  follows.  Sec¬ 
tion  II  summarizes  our  previous  work  on  QGEE  algorithm. 
Our  extending  works  are  discussed  in  Section  III.  Simulation 
results  are  given  in  Section  IV.  Section  V  concludes  this  paper. 

II.  Quality-Guaranteed  and  Energy-Efficient 
(QGEE)  Query  Procession  Protocol 

QGEE  employs  an  in-network  query  processing  method  to 
task  networks  through  declarative  queries,  which  is  critical 
for  reducing  network  traffic  when  accessing  and  manipulating 
sensor  data.  In  QGEE  algorithm,  only  a  subset  of  nodes  within 
a  network  will  be  chosen  to  acquire  readings  or  samples 
corresponding  to  the  fields  or  attributes  referenced  in  queries. 
The  goal  of  our  approach  is  to  reduce  interference  coming 
from  measurements  with  extreme  errors  and  to  minimize 
energy  consumption  by  providing  service  that  is  considerably 
necessary  and  sufficient  for  the  needs  of  application. 

From  four  aspects,  QGEE  implement  quality  control: 

•  Query  Vector  Space  Model  (VSM)  Design  and  Active 
Nodes  Selection:  In  QGEE,  VSM  is  employed  by  us  to 
combine  all  considering  factors  to  select  the  most  related 
nodes  to  participate  query  processing.  A  query  correlation 
is  designed  to  express  the  correlation  between  each  node 
and  a  query.  Moreover,  query  correlation  is  computed  by 
nodes  through  a  distributed  way,  and  active  nodes  are 
chosen  locally  leveraging  cooperations  among  nodes. 

•  Optimal  Location  Determination:  We  model  the  problem¬ 
determining  optimal  locations  for  a  query,  as  a  k-partial 
set  cover  problem.  It  is  a  NP  problems  which  requires 
time  that  is  exponential  to  the  problem  size.  We  exploit 
an  approximation  algorithm:  SETCOVER[13],  which  can 
acquire  the  solution  during  polynomial  time,  to  determine 
the  value  of  k  and  the  locations  of  these  k  disks  on  a 
plane.  In  QGEE,  we  choose  centers  of  those  k  disks  as 
our  optimal  locations. 

•  Semi-Manufactured  Query  Answer  Acquisition:  Since  a 
statistic  measurement  on  samples  can  rarely,  if  ever,  be 
expected  to  be  exactly  equal  to  a  parameter,  it  is  important 
that  an  estimation  is  accompanied  by  a  statement  which 
describes  the  precision  of  this  estimation.  We  utilize 
confidence  intervals[14]  to  state  both  how  close  the  value 
of  a  statistic  being  likely  to  be  value  of  a  parameter  and 
the  chance  of  being  close. 


•  Information  Collection:  After  active  nodes  are  chosen, 
a  data  centric  routing  algorithm,  EM-GMR[15]  is  em¬ 
ployed  by  QGEE,  which  is  a  multipath,  power-aware  and 
mobility-aware  routing  scheme.  It  is  used  to  establish 
route-tree  from  active  nodes  to  front-end  nodes  for  query 
answer  return.  EM-GMR  uses  reactive  networking  ap¬ 
proach,  in  which  it  finds  a  route  only  when  a  message 
is  to  be  delivered  from  source  to  destination.  EM-GMR 
considers  distance,  remaining  battery  capacity,  and  mo¬ 
bility  of  each  sensor  node  during  route  path  setting  up. 
This  scheme  could  tremendously  reduce  frame  loss  rate 
and  link  failure  rate  since  mobility  is  considered,  so  that 
incompleteness  information  caused  by  poor  link  quality 
can  be  reduced  at  certain  degree. 

In  energy  consumption  control,  first,  in  the  query  SVM  design, 
node  location  is  included  besides  measurement  quality  and 
remaining  battery  capacity,  since  it  is  directly  related  to  the 
necessary  number  of  active  nodes  to  cover  whole  monitoring 
region.  Through  solving  optimal  location  problem,  we  can 
employ  as  few  as  possible  nodes  to  cover  as  large  as  possible 
monitoring  region  in  order  to  carry  out  energy  reservation 
task.  Second,  we  tremendously  reduce  the  frame  loss  rate 
and  link  failure  rate  through  choosing  more  suitable  nodes 
to  set  up  route-tree  for  queries.  With  this  improvement,  we 
can  reduce  energy  consumption  for  route-tree  maintenance  and 
information  retransmission. 

III.  Extending  Works  for  QGEE 
A.  Query  Vector  Space  Model  (VSM)  Redesign 

In  information  retrieval,  VSM  is  one  of  very  efficient 
methods  to  quantify  the  correlation  between  a  query  and  all 
candidate  documents.  If  we  treat  all  sensor  nodes  as  candidate 
documents  for  a  query,  the  correlation  between  a  query  and 
nodes  can  be  determined  by  utilizing  the  same  principle  in 
information  retrieval.  But  we  should  redesign  the  VSM  vector. 
Following  factors  are  considered  by  us: 

•  Location 

Given  a  piece  of  space,  number  and  location  of  nodes 
determine  the  monitoring  coverage.  In  order  to  employ 
as  few  as  possible  nodes  to  cover  as  large  as  possible 
area,  we  should  select  those  nodes  located  at  some  special 
locations,  called  optimal  locations. 

•  Measurement  Quality 

Since  the  cost  and  the  measurement  quality  of  sensor 
node  are  related  to  each  other,  sensor  nodes  owning 
various  qualities  are  always  deployed  simultaneously  in 
a  WSN  for  economical  reasons.  Furthermore,  through  a 
query,  database  users  supply  not  only  what  information 
they  are  interested  in,  but  also  the  expectation  on  query 
answers’  quality,  i.e.,  the  confidence  of  query  answers. 
In  this  case,  we  should  select  suitable  nodes  to  response 
queries. 

•  Battery 

Remaining  battery  capacity  of  sensor  node  is  the  third 
factor,  but  not  the  least  important  one.  When  the  battery 
of  a  node  is  used  up,  the  uncertainty  of  query  answer, 
at  some  degree,  will  increase  since  data  collected  by  this 
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node  is  missed.  It  inspires  us  to  select  those  nodes  with 
as  high  remaining  battery  capacity  as  possible,  so  that  all 
expected  information  can  be  collected  with  best  effort. 

We  employ  VSM  to  incorporate  above  all  factors  into  the 
selection  of  the  most  related  nodes  for  query  processing.  We 
call  those  nodes  active  nodes.  Vector  T  is  designed  as  T  = 
{LC,  QC,  RBC}  to  represent  each  candidate  node. 

»  LC  stands  for  the  location  correlation.  It  is  the  indicator 
of  the  distance  between  the  location  of  a  sensor  node  at 
(x,y)  and  the  optimal  location  at  ( xo,yo ).  Smaller  value 
of  LC  indicates  a  node  is  closer  to  a  optimum  location. 

It  is  defined  as  LC  =  Ts)-~^v  y°)  .  R  is  a  constant 

to  uniform  LC.  Usually,  we  choose  nodes’  sensor  ranging 
as  R. 

•  QC  stands  for  the  measurement  quality  correlation.  It  is 
defined  as  QC  =  1  -  MEQ.  MEQ  is  the  confidence 
of  measurement  bias.  For  example,  for  speed  detecting 
sensor  nodes,  CXM539  [16],  the  bias  is  ilmGauss  and 
owns  0.95  confidence.  In  this  case,  MEQ  =  0.95. 

•  RBC  stands  for  the  remaining  battery  capacity  correla¬ 
tion.  It  is  defined  as  RBC  =  btST~bt  .  BT  is  the 

&  +  max 

remaining  battery  level.  BTmax  is  the  biggest  value  for 
BT  can  be. 

As  we  know  that,  the  typical  composite  vector  similarity 
measure  is  the  cosine  measure,  which  represents  the  cosine  of 
the  angle  between  a  query  and  a  document  as  shown  in  (1). 

Era 

.  ,  Vi  •  Wa 

Simvs(q,Xi)  =  w  -  -Ai  =  (1) 

VA”. <»’>•£,">  K-> 

where  m*  is  the  number  of  unique  terms  in  a  document 
collection.  Document  weight  Wij  and  query  weight  Vj  are 

Wij  —  fijWij  =  fijlog(N/dj)  and 

{log(N/dj )  yjisaterminq 
0  otherwise. 

However,  we  found  that,  besides  the  angle  between  two 
vectors,  the  length  difference  of  vectors  (the  length  of  vector 
h  is  defined  as  |  h  |)  impacts  the  similarity  of  two  vectors  as 
well.  We  try  to  compare  the  similarity  among  several  vectors 
in  a  2-dimension  space,  noted  by  bold  letters  A,  B  and  C  (See 
Fig.  1(a)).  Observe  that  the  angles  between  A  and  B,  C  and  B 
are  same,  noted  as  angle  6.  Based  on  the  concept  of  cosine 
measure  for  vector  similarity,  hence,  A  and  C  have  same 
similarity  coefficient  with  B.  Therefore,  the  cosine  measure 
cannot  identify  vectors,  which  own  same  angle  to  the  fiducial 
one.  In  this  case,  the  function  blind  area  of  cosine  measure 
will  impact  the  performance  of  VSM.  That  is,  the  confidence 
of  query  answer  may  be  degraded  since  truly  similar  nodes 
cannot  be  elected  according  to  users’  requirement. 

Note  that,  cosine  measure  is  not  a  method  that  reflects  the 
similarity  among  vectors  accurately  or  completely  (i.e.,  only 
from  the  angle  aspect  to  express  similarities).  In  this  paper,  we 
propose  two  substitute  methods  to  alleviate  this  disadvantage 
of  traditional  VSM.  They  are:  Cosine-Length  Measure  (CLM) 
and  Joint-Deference  Measure  (JDM).  CLM  is  a  modification 


Fig.  1 ,  (a)  2-dimension  space  for  vector  similarity  and  (b)  3-dimension  space 
for  vector  similarity 


of  cosine  measure,  which  takes  the  length  difference  between 
two  vectors  into  account  for  similarity  coefficient  calculation. 
There  are  two  ways  to  implement  CLM.  One  is  that  we  project 
the  fiducial  vector  onto  the  measured  vectors  to  calculate  the 
length  difference  of  vectors.  As  shown  in  Fig.  1(a),  fiducial 
vector  B  is  projected  onto  the  measured  vector  A  and  C 
individually,  then  ad  stands  for  the  length  difference  between 
A  and  B,  as  well  as  cd  stands  for  the  length  difference  between 
C  and  B.  We  call  this  method  CLM  I.  The  other  is  that  we 
project  the  measured  vector  onto  the  fiducial  vector.  As  shown 
in  Fig.  1(a),  A  and  C  are  separately  projected  on  B,  then  be 
stands  for  the  length  difference  between  A  and  B  and  fb  stands 
for  the  length  difference  between  C  and  B.  We  call  this  method 
CLM  II. 

Considering  that  a  vector  is  a  directed  quantity  that  can 
be  resolved  into  components/elements  and  the  similarity  of 
vector  is  the  combination  of  all  elements,  JDM  separately 
measure  the  similarity  on  each  direction/element  through 
measuring  length  difference,  then  linearly  combine  them 
together  to  achieve  the  joint  similarity  to  act  as  the  similarity 
coefficient.  For  instance,  as  shown  in  Fig.  1(a),  a'b'  stands  for 
the  length  difference  between  A  and  B  at  x  axis,  and  a"b" 
stands  for  the  length  difference  at  y  axis.  Similarly  for  C  and 
B,  c'b'  and  c"b"  stand  for  the  length  difference  at  x  and  y 
axis  respectively. 


1)  Cosine-Length  Measure  I  (CLM  I): 


SCclmI(Q,  T) 


-{cos6  +  di(Al)} 


(i  +  My 


£ 


M1  +  & 


j=irjQj 


?£*<?'■ 
i  3H3 


H0  is  held 

(3) 

otherwise 


where  H0  is  |  T  |>|  Q  |  . 7§£=f=f ■  #i(Al)  is  the 

part  corresponding  to  the  length  difference  A l  of  vectors. 
In  this  paper,  t  equals  to  3  based  on  the  definition  on 
query  vector  T.  6  6  [0,  f  ],  since  LC,  QC  and  RBC 
are  all  positive  numerics. 

For  the  fli(Al)  term  in  (3),  we  exploit  the  uniformed 
length  difference  between  the  node  vector  T  and  the 
query  vector  Q  at  T’s  direction.  In  CLM  I,  opposite 
to  cosine  measure,  even  though  the  angle  of  measured 
vectors  to  a  fiducial  vector  might  be  equal,  the 
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difference  introduced  by  length  is  not  always  identical 
too.  Therefore,  CLM  I  method  can  successfully  remove 
the  function  blind  area  of  cosine  measure,  For  instance, 
shown  in  Fig. 1(a),  SCcim  of  A  does  not  equal  to  C’s 
any  more  based  on  CLM  I,  since  obviously  the  length 
difference  da  for  A  and  B  is  shorter  than  dc  for  C  and  B. 

2)  Cosine-Length  Measure  II  (CLM  II): 

Similarly  to  CLM  I,  but  we  project  the  measured  vectors 
onto  the  fiducial  vector.  Then  the  similarity  coefficient 
calculation  is  modified  as  in  (4). 

SCcimii(Q,~£)  =  ~{cos6  + ’du(Al)} 


TABLE  I 

Vector  Similarity  Measurement  Using  Cosine  Measure, 
Cosine-Length  Measure  and  Joint-Difference  Measure. 


sc 

Q 

A 

B 

C 

D 

(x,  y)  (m) 

(10,10) 

(10,10) 

(10,10) 

(10,10) 

(10,10) 

Measure  Quality(%) 

90 

90 

98.04 

75 

95 

Battery  Level  (J) 

5.0 

4.5 

4.902 

3.751 

4.5025 

LR  (m) 

0.0 

0.0 

0.0 

0.0 

0.0 

MQ  (%) 

0.1 

0.1 

0.0196 

0.25 

0.05 

RBL  (J) 

0.0 

0.1 

0.0196 

0.2448 

0.0995 

COSINE 

- 

0.7071 

0.7071 

0.7071 

0.449 

CLM  I 

- 

0.6036 

0.6386 

0.4538 

0.4261 

CLM  II 

- 

0.8536 

0.4516 

0.6415 

0.4745 

JDM 

- 

0.9667 

0.9637 

0.8612 

0.9483 

K1  + 151 
M1  + 


§===  Ho  is  held 


Here  H0is\Q  |>|  T 


=irj  <h 


CLM  II  can  gain 


the  same  advantage  as  CLM  I,  since  obviously  the  length 
difference  eb  for  A  and  B  is  shorter  than  ef  for  C  and  B. 


3)  Joint-Deference  Measure  (JDM): 


1  * 

SCjdm(Q,  T)  =  -(^2  SI  Mi) 


i=l 


JDM  simplifies  the  problem  of  measuring  vector  sim¬ 
ilarity  into  measuring  scalar  similarity,  i.e.,  SCjdm  is 
decomposed  into  amount  of  parts  -  SI  Mi  along  indi¬ 
vidual  elements  of  vector.  The  factor  j-  in  front  of  (5) 
comes  from  the  equal  gain  combination  combination 
for  acquiring  the  joint  similarity  according  to  all  vector 
elements.  In  this  paper,  based  on  the  query  vector  design, 
(5)  is  specified  as  in  (6). 


(4) 

otherwifye^fi  e  are  same  (j  g.,  SC=0.707 1 ).  However,  CLM  and  JDM 
methods  can  successfully  identify  the  similarities  to  query  q 
for  noes  a  ~  d  individually.  We  let  A,  B  and  C  has  the 
same  angle  with  Q,  but,  the  distance  from  B  to  Q  is  the 
shortest  one  among  A,  B  and  C  at  node  vectors’  direction, 
and  the  distance  from  A  to  Q  is  the  shortest  one  among  A, 
B  and  C  at  q’s  direction.  Note  that,  node  b' s  SC  is  bigger 
than  a  and  c’s  for  CLM  I  (i.e.,  0.6386  >  0.6036  and  0.4538) 
and  node  a’s  SC  is  bigger  than  b  and  c’s  for  CLM  II  (i.e., 
0.8536  >  0.4516  and  0.6415).  Furthermore,  we  let  the  angle 
from  D  to  Q  is  bigger  than  the  one  from  A  to  Q,  while 
the  distance  from  D  to  Q  is  same  as  A  to  Q.  Observe  that, 
node  a’s  SC  is  larger  than  node  d’s  for  CLM  I  and  CLM 
II  (i.e.,  0.6036  >  0.4261  and  0.8536  >  0.4745).  All  above 
observations  are  consist  with  our  analysis  when  we  designing 
CLM  and  JDM  methods. 


(5) 


SCjdm(Q,T)  = 


-{SIMia  +  SIMqc  +  SIMrbc) 


h 


LC  -  LCa 


LC0 


+ 


QC  ■ 


QC, 


+ 


RBC  -  RBCq 
RBC' 


l> 


where  SIMic,  SIMqc  and  SIMrbc  are  the  similarities 
between  a  node  and  a  query  at  the  location,  measurement 
quality  and  battery  aspects  separately. 

Note  that,  SCjdm  is  a  linear  combination  of  similarities 
at  all  elements.  Instead  of  treating  each  element  equally, 
we  can  easily  change  the  weights  of  elements  to  imple¬ 
ment  adaptive  combination  based  on  users’  expectation. 
That  can  be  a  future  work  of  us. 

To  evaluate  the  performance  of  our  CLM  and  JDM  methods, 
we  set  up  a  network  scenario,  in  which  there  are  4  candidate 
nodes  a  ~  d  for  a  query  q.  Query  vectors,  noted  by  A  ~  D 
and  Q,  for  all  nodes  and  query  are  shown  in  Table  I.  We  use 
cosine  measure,  our  CLM  and  JDM  to  calculate  their  similarity 
coefficients  individually.  The  results  are  shown  in  Table  I. 

Note  that,  cosine  measure  can  only  differentiate  node  d  and 
nodes  a  ~  d’s  similarities  to  query  q,  since  SCs  for  nodes  a, 


B.  Sample  Size  Determination 

We  have  chosen  a  set  of  nodes,  i.e.,  active  nodes,  to  respond 
a  query.  Then,  the  situation  on  the  region  of  interest  can  be 
inferred  from  the  data  obtained  in  a  sample  generated  by 
u2a!ctive  nodes.  While,  “How  many  measurements  should  be 
included  in  one  sample  (any  subset  of  a  population)?”  and 
^‘|Iow  should  the  answer  be  presented  with  more  information 
on  confidence?”  are  two  targets  in  this  Section.  Sample  size 
determination  refers  to  the  process  of  determining  exactly  how 
many  samples  should  be  measured/observed  in  order  that  the 
sampling  distribution  of  estimators  meets  users’  pre-specified 
target  precision  [17].  Moreover,  a  statistic  measurement  on 
samples  can  rarely,  if  ever,  be  expected  to  be  exactly  equal 
to  a  parameter,  thus  it  is  important  that  an  estimation  is 
accompanied  by  a  statement  which  describes  the  precision  of 
this  estimation. 

As  a  matter  of  fact,  nodes’  readings  are  subject  to  many 
small  and  random  errors  which  are  caused  by  limitations 
of  device’s  hardware  and  environmental  noise.  Consequently, 
uncertainty  is  inherent  regarding  to  true  values.  In  this  paper, 
a  node’s  reading  x  is  formulated  as: 


x  =  v  +  em+V 


(7) 


where  v  is  the  true  value,  em  is  the  measurement  error 
introduced  by  limitations  of  device’s  hardware,  and  rj  is  the 
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environmental  noise.  We  assume  em  and  77  are  independent 
random  variables  and  the  mean  for  both  of  them  is  zero. 

Statistical  inference  is  any  procedure  by  which  one  gener¬ 
alizes  to  a  population  from  the  data  obtained  in  a  sample.  In 
the  precision  description  of  the  estimation,  confidence  interval 
(CI)[?]  provides  a  method  of  stating  both  how  close  the  value 
of  a  statistic  being  likely  to  be  value  of  a  parameter  and  the 
chance  of  being  close.  A  Cl  of  an  attribute  denoted  by  Ui 
is  a  interval  [U,hi]  such  that  lt  and  hi  are  real-valued,  and 
that  the  condition  hi  >  h  holds.  In  the  following,  we  discuss 
the  determination  of  confidence  interval  and  sample  size  under 
knowing  or  unknowing  the  distribution  information  separately. 

1)  Knowing  the  distribution  of  em  and  i):  Based  on  the 
Central  Limit  Theorem,  we  assume  77  ~  N(0,  •^tt)  and 
em  ~  N(0,  <j\).  Generally,  in  product’s  technical  datasheet, 
manufactories  supply  the  information  on  measurement  errors. 
For  example,  as  we  mentioned  above,  the  bias  for  CXM539  is 
±1  Gauss  with  0.95  confidence.  That  means  for  sensor  nodes 
CXM593,  o\  =  0.1302.  For  general  cases,  if  we  know  the 
maximum  bias  Ax  and  its  confidence  p,  we  can  obtain  the 
general  expression  of  of  That  is 

9  Ax2 

ol  ~ - 7 -  (8) 


where  Q(x)  stands  for  Q-Function,  defined  as  Q(x)  = 

r  00  1  —uLj 

L  2dv- 

Therefore,  node  reading  also  complies  with  a  Gaussian 
distribution  with  /rx-mean  and  ax -standard  deviation  given  in 
(9).  The  PDF  of  node  reading  fx(x)  is  shown  in  (10). 


px  =  v  and  ox  = 

2^N» 

r-+T 

(9) 

II 

♦— 1 

(X-Vp 

=e  2(-I+^)2 

(10) 

V2*  (^  +  f)2 

To  infer  the  situation  on  the  region  of  interest  from  the 
remote  sensing,  we  do  point  estimates  for  the  population  mean 
v  with  sample  mean  vn  —  £  J2j=i  xi>  which  is  an  unbiased 
estimation  (i.e.,  the  estimator  aims  at  the  true  value  or  is 
correct  average  [18])  through  random  sampling.  Here  n  is 
the  sample  size.  Note  that,  the  sampling  distribution  of  vn 
has  mean  px  and  standard  deviation  fof  Also,  the  quantity 
-Aj-  has  a  normal  distribution.  Consequently,  in  p  percent  of 

%  1—D  &x  %  1  — p  <7x 

the  samples  the  interval  from  vn - to  vn-i — will 

include  the  value  of  v.  Here,  zi^  is  the  lOO^p^  percentile 
obtained  from  the  normal  distribution.  That  is, 

Z  l—p  (7 x.  Z  1  —  p  (J x 

Pr{  Vn - ^0=  <  U  <  Vn  -|-  - ^7=—}>p  (11) 

\Jn  fn 

z  i-p  a  1 

With  (11),  we  obtain  a  bounded  value,  i.e.,  [vn - — , 

vn  -I - ],  which  owns  p  confidence  (not  probability).  We 

call  this  kind  of  query  answers  from  active  nodes  as  “semi¬ 
manufactured”  query  answers. 

The  margin  of  error  of  a  confidence  interval  is  defined  to 
be  the  value  added  or  subtracted  from  the  estimator  which 


determines  the  length  of  the  Cl.  The  length  of  Cl  expresses  the 
amount  of  uncertainty.  Summary,  there  are  four  main  factors 
affect  the  length  of  Cl.  There  are:  sample  size,  sample  mean, 
confidence  level  and  sample  standard  deviation.  In  practice, 
when  one  computes  a  confidence  interval  with  specified  con¬ 
fidence  and  width,  one  must  decide  the  sample  size  to  use.  This 
decision  involves  a  trade-off  between  energy  consumption  and 
estimation  confidence.  In  QGEE,  given  the  margin  Ax  of  error 
between  the  estimator  vn  and  the  true  value  v,  to  specify  our 
capability  for  ensuring  this  error  not  to  be  smaller  than  p,  the 
criterion  for  sample  size  determination  is  stated  as: 

Pr{\vn  -  u|  <  As}  >  p  (12) 


Solving  (12)  and  (1 1)  for  sample  size  n,  we  obtain  (13).  In 
other  words,  with  the  sample  size  determined  by  (13),  we  can 
state  that  the  Cl  will  be  of  length  2  Ax,  or  that  the  precision  of 
the  estimate  is  ±Aa:.  Moreover,  the  sample  size  is  in  inverse 
proportion  to  the  length  of  Cl. 


n  — 


Ax2 


(13) 


2)  Without  knowing  the  distribution  of  em  and  77: 

In  some  cases,  it  is  not  feasible  to  acquire  the  information 
on  the  distribution  of  em  and  77.  The  method  designed  above 
may  not  be  available  any  longer  to  determine  the  confidence 
interval  and  sample  size  in  this  circumstance.  We  extend 
our  work  into  scenarios  with  large  sample  and  small  sample 
individually. 

i  Large  Sample  Scenario: 

Even  though,  we  have  no  idea  on  the  distribution 
of  em  and  77.  The  Central  Limit  Theorem  specifies 
that  i>n  ~  Ar(0,  ~).  Here  s  is  the  sample  standard 
variance.  Since  s  is  almost  certainly  close  to  ox,  the 
quantity  is  approximately  normal  with  mean  0 
and  variance  1.  Therefore,  we  can  safely  substitute 
ox  with  s  in  (1 1)  and  (13),  so  that  to  determine  the 
confidence  interval  and  sample  size  with  specified 
confidence  and  width.  They  are: 

Z  l—p  S  Z  l—p  s 

Pr{Vn  ~  — 7=-  <  V  <  Vn  +  - 7=-}>P  (14) 

fn  y/n 


ii 


n=  [(- 


2l- 


Ax 


-)2J 


(15) 


Small  Sample  Scenario: 

Since,  for  small  sample  size,  s  may  not  be  close  to 
ox,  and  vn  may  not  be  approximately  normal.  But 
the  quantity  vfTv ,  even  though  which  will  not  have 

•y/n 

a  normal  distribution  any  more,  has  the  Student’s  t 
distribution  with  n  —  1  degrees  of  freedom,  which 
we  denote  fn-i-  Consequently,  we  have 


s  s 

Pr{vn~t  l^(n-l)— -=  <  V  <  Vn+t  i=i>(tI-1)— =}  >  p 
2  Jn  2  Jn 

(16) 

The  adequate  sample  size,  which  ensures  the  length 
of  Cl,  i.e.,  [ vn  ti-p  (n  1)-^^,  vn-\ ~t  1  -p  (n— 1)^77], 
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is  2  Ax  with  p  confidence,  should  satisfy  (17). 


Vn  > 

t  l-B  (n  —  1)  —  Ax 
2 


(17) 


C.  Confidence  Computation  on  Query  Answer 

•  MAXIMUM  Aggregation:  In  MAXIMUM  aggregation 
operation,  we  have  design  (1 8)[12]  to  acquire  the  final 
confidence  interval. 


Imax  =  argmnx{li},  and  hmax  =  arg  max{/ii}  (18) 

i  i 

Since  Zmax  =  an?  max*  (£„.),  the  confidence  of  Zmax 
being  covered  by  the  confidence  interval  according  to  (1 8) 
is 


Pmax 


Fzm„Jh  max  max ) 

y/^ii^max  —  /Ae*)\ 


n«( 

i=  1 


_  Y^Q^i{imax  (19) 

tl  °Xi 

•  MINIMUM  Aggregation:  In  MINIMUM  aggregation  op¬ 
eration,  we  have  design  (20)  [12]to  acquire  the  final 
confidence  interval. 


Imin  =  argmm{li},  and  hmin  —  argminl/i, }  (20) 
i  i 

Since  Zmin  =  ar#  min*  (t)n.),  the  confidence  of  Zmin 
being  covered  by  the  confidence  interval  according  to  (20) 
is 


Pmin  Fzjninihmin)  ■^'Zmini^Fnin) 

=  f[{i  -  Q(^luin-~  ^ )} 

-  H{1  (21) 

»= 1  °x' 

•  AVERAGE  Aggregation:  In  AVERAGE  aggregation  op¬ 
eration,  we  have  design  (22)  [12]to  acquire  the  final 
confidence  interval. 


av9~ 

Since  Zavg  =  i  Yh=i  vn; ,  the  confidence  of  Zavg  being 
covered  by  the  confidence  interval  according  to  (22)is 


(22) 


*> avg 


1  ^ 


and  h 


j=i 


IV.  Simulations  and  Performance  Evaluation 

We  run  Monte  Carlo  simulations  to  remove  the  randomicity 
of  simulation  results  and  to  compare  the  performance  of  cosine 
measure  and  our  CLM  and  JDM  methods.  In  our  simulations, 
1 00  temperature  sensor  nodes  are  randomly  deployed  in  a  area 
A  (10  x  10m2).  Individual  sensor  nodes  is  connected  to  other 
nodes  in  its  vicinity  through  wireless  communication  interface, 
and  it  uses  a  multihop  routing  protocol  (i.e.,  EM-GMR[15])  to 
communicate  with  nodes  that  are  spatially  distant.  All  nodes 
are  interconnected  to  a  front-end  node  directly/indirectly. 
Queries  are  injected  into  this  WSN  through  this  query  operator, 
and,  on  the  opposite  direction,  collected  data  are  returned  to 
this  query  operator  to  obtain  query  answer  for  users.  Within 
this  WSN,  each  node  owns  equal  computing  and  sensing 
capability  (i.e.,  sensor  ranging  is  lm),  but  measurement  quality 
for  sensor  parts  may  be  not  identical. 

Assume  the  query  operator  receives  a  total  of  M  data 
samples  originated  from  active  nodes  elected  by  different 
SVM  methods  in  A.  We  utilize  the  data  sample  from  each 
active  node  to  represent  the  true  value  within  the  disk  region 
monitored  by  this  active  node.  We  define  the  reconstruction 
distortion  as  in  (24)  to  evaluate  the  performance  for  individual 
methods. 

1  M 

e = m  E {5(a:)  -  x  e  A  <24> 

i=  1 

Here  S(x)  is  the  source  of  interest  in  A.  When  S(x)  stands 
for  the  answer’s  quality,  reconstruction  distortion  e  stands  for 
the  distortion  between  the  expected  quality  and  the  actually 
achieved  quality  for  query  answers.  The  positive  value  of 
e  means  even  though  higher  quality  answer  is  expected  by 
users  the  confidence  for  answer  actually  achieved  is  some 
poorer/lower  -  negative  distortion,  and  negative  value  means 
more  confident  answer  is  acquired  than  the  expected  one  - 
positive  distortion.  Under  various  expected  qualities  from  0.55 
to  1 .0,  we  obtain  a  branch  of  curves  for  cosine  measure,  CLM 
and  JDM  separately  (See  Fig. 2). 
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Fig.  2.  Reconstruction  Distortion  for  Query- Answer  Quality  Caused  by 
Different  SVM  Methods  Utilized 

Note  that,  CLM  and  JDM  surpass  cosine  measure  in  terms 
of  quality  distortion,  since  the  e  for  cosine  measure  is  around 
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°'53Q55~°5137q  x  100  =  74.42%  larger  than  the  ones  for  CLM 
and  JDM.  That  is,  it  is  more  difficult  for  cosine  measure  to 
make  the  confidence  of  answer  satisfy  users’  expectation  than 
CLM  and  JDM.  We  also  observe  that  CLM  I  can  achieve  more 
confident  answer  when  expected  quality  is  smaller  than  0.75, 
while  wore  answers  when  expected  quality  is  larger  than  0.75 
than  by  CLM  II  and  JDM.  In  addition,  the  stabilities  for  CLM 
II  and  JDM  are  better  than  cosine  measure  and  CLM  I  with 
the  expected  quality  changed,  i.e.,  the  biggest  changes  are  just 
0.0625. 

When  S(x)  stands  for  the  network  lifetime  that  is  defined 
as  the  period  all  nodes  used  up  their  batteries,  reconstruction 
distortion  e  stands  for  the  distortion  between  the  expected 
network  lifetime  and  the  actually  achieved  network  lifetime.  In 
our  simulations,  we  set  the  expected  network  lifetime  50000 
seconds.  Under  various  expected  qualities  from  0.55  to  1.0, 
we  obtain  a  branch  of  curves  for  cosine  measure,  CLM  and 
JDM  separately  (See  Fig. 3). 


Fig.  3.  Reconstruction  Distortion  for  Network  Lifetime  Caused  by  Different 
SVM  Methods  Utilized 

Note  that,  generally  cosine  measure  surpasses  CLM  and 
JDM  in  terms  of  network  lifetime  distortion,  except  that  CLM 

I  achieves  longer  lifetime  when  expect  quality  is  higher  than 
0.9.  In  addition,  cosine  measure  owns  higher  stability  with 
the  change  of  expected  qualities  than  CLM  and  JDM,  i.e., 
the  largest  change  is  only  125  seconds.  Among  CLM  I,  CLM 

II  and  JDM,  it  is  a  watershed  where  expected  quality  equals 
0.62.  That  is,  JDM  can  elect  higher  remaining  battery  nodes 
to  responde  query  than  CLM  I  and  CLM  II  at  the  left  side 
of  0.62,  while  CLM  I  performances  better  than  CLM  II  and 
JDM  for  the  rest  circumstances. 

From  above  simulation  results,  we  note  that  cosine  measure, 
CLM  and  JDM  methods  show  various  sensitivity  for  answer 
quality  and  network  lifetime,  although  we  consider  measure¬ 
ment  quality,  node  location  and  battery  equally.  Hence,  JDM 
method  should  be  the  first  choose  if  answer  quality  is  the  best 
consideration  for  query  processing,  otherwise  cosine  measure 
should  be  the  choose  when  hoping  to  achieve  the  longest 
network  lifetime,  especially  when  the  expected  quality  is 
less  than  0.9.  Therefore,  there  is  a  tradeoff  between  answer 
quality  and  network  lifetime.  We  should  choose  the  SVM 


methods  based  on  the  specifical  applications.  For  instance, 
when  expected  quality  is  not  smaller  than  0.8,  CLM  I  can 
be  used  to  reach  the  middle  point,  which  achieves  not  only 
some  higher  confidence  answer,  but  also  some  longer  network 
lifetime. 

In  Table  II,  we  compare  the  answer  qualities  for  MAX¬ 
IMUM,  MINIMUM  and  AVERAGE  data  aggregations.  In 
addition,  we  also  check  the  impact  of  sample  size  (from  5 
to  100)  on  answer  quality,  pi  is  the  probability  that  the  true 
answer  locate  within  the  confidence  interval  obtained  through 
(18),  (20)  and  (22).  p2  is  computed  through  the  PDF  acquired 
through  (19),  21)  and  (23). 

TABLE  II 


MAXIMUM 

MINIMUM 

AVERAGE 

Pi 

P2 

Pi 

P9 

Pi 

....  Vi 

5 

0.955 

0.95825s 

0.965 

0.95839 

0.747 

0.73641 

20 

0.955 

0.96017 

0.96 

0.96068 

0.981 

0.9746 

50 

0.96 

0.96123 

0.964 

0.9611 

1.0 

0.99959 

100 

0.955 

0.96118 

0.97 

0.96217 

1.0 

1.0 

Note  that,  the  very  small  difference  between  pi  and  P2,  even 
at  various  sample  size  and  data  aggregation,  demonstrates  the 
correctness  of  confidence  interval  and  PDF  of  measurement 
we  designed. 

V.  Conclusions 

Starting  from  the  characteristics  of  newly  appeared  wire¬ 
less  sensor  networks,  such  as  decentralized  nature,  limited 
computational  power,  imperfect  information  recorded,  and 
energy  scarcity  of  individual  sensor  nodes,  we  have  done  some 
works  to  solve  energy  constraint  and  quality  required  problems 
from  active  nodes  election,  information  collection  to  query 
answer  expression  perspectives.  In  this  paper,  extending  our 
previous  works,  we  propose  two  methods  to  substitute  cosine 
measure  for  vector  similarity:  Cosine-Length  Measure  (CLM) 
and  Joint-Deference  Measure  (JDM).  Through  considering  the 
impact  of  vector  length  on  vector  similarity,  CLM  alleviates 
the  disadvantage  of  traditional  VSM,  in  which  the  confidence 
of  query  answer  may  be  degraded  since  truly  similar  nodes 
cannot  be  elected  according  to  users’  requirement.  JDM 
upgrades  the  accuracy  and  degrades  the  complexity  for  the 
computation  on  similarity  coefficient  through  simplifying  the 
measure  from  vector  domain  to  scalar  domain.  In  addition, 
with  the  distributions  of  measurement  error  and  environment 
noise  known  and/or  unknown  respectively,  we  formulate  the 
criteria  to  determine  the  optimum  sample  size  to  meet  users’ 
pre-specified  target  precision.  Through  simulation,  we  check 
the  validities  and  sensitivities  of  cosine  measure,  CLM  and 
JMD  methods  on  answer  quality  and  network  lifetime.  Fur¬ 
thermore,  our  simulation  results,  in  this  paper,  form  a  set  of 
criteria  for  method  selection  based  on  specific  applications. 
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